Data Preprocessing part 1: https://github.com/musama619/DataPreProcessing
-
OneHotEncoder
-
LabelEncoder
- Standardisation
2. Normalisation : values between 0 and 1 (recommended only when features follows normal distribution)
- remember to apply feature scaling after train test split
- feature scaling does not need to be applied on dummy variables as they already have values between -3 and 3
Important link: