This project predicts the prices of houses located in the cities of the US, with the help of essential features.
The dataset which is used here, is collected from Kaggle website. Here is the link of the dataset : https://www.kaggle.com/shree1992/housedata
The goal of this project is to make a prediction model which will predict the prices of the houses using different parameters.
- Importing all the required libraries. Check
requirements.txt
. - Upload the dataset and the Google Collab file.
- Exploratory Data Analysis
- Data Processing
- Prediction Models
- Linear Regression
- Random Forest Regression
- Lasso Regression
- OLS Regression
- Ridge Regression
- Bayesian Regression
- ElasticNet Regression
- Model Comparison
- Conclusion
- Numpy
- Pandas
- Matplotlib
- Sklearn
- Seaborn
- Pairplot
- Distplot
- Average price of houses depending on no. of bedrooms
- Pearon Correaltion matrix
Model-Name | Training Accuracy | Testing Accuracy |
---|---|---|
Linear Regresssion | 40% | 52% |
Random Forest | 88% | 15% |
Lasso Regresssion | 39% | 52% |
- Therefore, Random Forest works the best for the following dataset.
Code Contributed by Vidhi Bhatt, 2021 @VidhiBhatt01 #GWOC21