Data Preprocessing part 2: https://github.com/musama619/Data-Preprocessing
* Data Entry Error
* Sampling Error
* Measurement Error
Detect Outliers using EDD and visualizations using scatterplot, histogram, boxplot or jointplot
- Capping and Flooring Impute values above 3(p99) and below 0.3(P1)
- Other Methods
* Log Function
helpful_log = np.log(df.Helpful_Votes + 1)