/purwadhika-final-project

Final Project for Data Science Program at Purwadhika Digital School

Primary LanguageJupyter NotebookMIT LicenseMIT

Project logo

Washington D.C. Residential Properties Price Prediction ๐Ÿ 

Status Code Size Contributors License


This web app uses actual transaction data and machine learning models to predict the prices of housing in Washington, D.C.. The transaction data contain attributes and transaction prices of residential property that respectively serve as independent variables and dependent variables for the machine learning models.

๐Ÿ“ Table of Contents

๐Ÿง Problem Statement

The housing market is one of the most crucial components of any national economy. Hence, observations of the housing market and accurate predictions of real estate prices are helpful for real estate buyers and sellers as well as economic specialists. It is crucial for the establishment of real estate policies and can help real estate owners and agents make informative decision. However, real estate forecasting is a complicated and difficult task owing to many direct and indirect factors that inevitably influence the accuracy of predictions.

๐Ÿ’ก Idea / Solution

The main idea is to train a regression model using historical transactions. The output of this model is the predicted price of a house given its features. It will also give the range of price by 95% predictions interval which means given a prediction of โ€˜yโ€™ given โ€˜xโ€™, there is a 95% likelihood that the range โ€˜aโ€™ to โ€˜bโ€™ covers the true outcome. In order to measure the performance of the model, we focus on Mean Absolute Error (MAE) because it is robust to outliers.

๐Ÿ Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

Make sure that you have Python 3.6 - Python 3.8 installed. The libraries required to run this project is in the requirements.txt. You can install them using PIP.

pycaret==2.3.2
catboost>=0.23.2
pandas==1.1.5
streamlit

Running

In order to run a streamlit app, all you need to do is write the following command.

streamlit run app.py

Thatโ€™s it! In the next few seconds the app will open in a new tab in your default browser.

โ›๏ธ Built With

Deployment

โœ๏ธ Authors

Special thanks to our mentor, @Muhammad who provided a lot of feedback and insights.

๐ŸŽ‰ Acknowledgments

This project uses an open source dataset, which include 48 explanatory features and 158,957 entries of housing sales in Washington, D.C from 1947 to 2018. The data is available at Open Data D.C. and the residential and address point data is managed by the Office of the Chief Technology Officer.