A simple script to check a given dataset index file and compute simple statistics and fix the dataset for small problems like skewness.
I did it to practice python but now I use pandas and scikit-learn to do it more easily for sure! with sub/over sampling.