/Data_Cleaning

Primary LanguageJupyter Notebook

Data Exploration and Data Cleaning Assignment

We worked with the Kaggle competition's dataset on property sales price estimation in Melbourne, Australia, specifically with the reduced data set produced by DanB. In the first part, we performed exploratory data analysis, outlier detection, feature selection and creation of new features by merging with an AirBnB dataset. In the second part, we encoded the variables, tried different imputation methods for the columns with missing values and applied PCA to reduce the dimensionality