This is a simple script that reads in an csv and plots a histgram about a column that has numbers.
It will also plot mean and median and 1Q and 2Q and all the good stuff.
-
Download the script, but it in the same folder as your csv.
-
Make sure the column you want to analyze has only numbers (get rid of things like "%")
-
Use command line to navigate to that folder, then type
python get_dataset_shape.py
-
The command line is going to ask you which file you'd like to analyze, type in the filename, include extensions like ".csv"
-
The command line is also going to ask you which column you'd like to analyze, type in the column name.
-
Wait for magic to happen. Here is an example: