Car price prediction

This school project is about predicting second hand cars using machine learning. This project needed skill in Machine learning (linear regression, NLP), data cleaning, and feature engineering. Files :

Main files are at the roots of the repo
EDA is in the notebook folder
autopluspy is a custom python library made for this project

Getting started

git clone the project
create a virtualenv

virtualenv -p python3 venv

Install dependencies

pip install -r requirements.txt

Put the initial dataset into /data folder
Run the jupyter notebook Runbook (available at the roots of the repo) to launch the whole system. Uncomment the last cell if you want to start the streamlit app

Architecture and features

Data Engineering

Input:

Initial dataset
Eventually new dataset

Process:

Output

Processed dataset
Data Dictionary

Machine Learning

Input:

Dataset
Data Dictionary

process:

Output:

Regression Model
Std model
Features needed for prediction with possible value

App

Input:

Data Dictionary
Features list needed for the prediction

Interaction :

form
display prediction and price tuning range
how this car price is considering others cars price (good deal or not)

Cheatsheet Streamlit

## User input : text
Model_year = st.text_input('Model_year', '2010')

charlespv/car-price-prediction-ml

Car price prediction

Getting started

Architecture and features

Data Engineering

Machine Learning

App

Cheatsheet Streamlit