Predicting USA house / real estate prices using Linear Regression (via scikit-learn)
- load Pandas DataFrame containing housing data retrieved.
- do some simple data exploration / visualisation
- remove non-numeric data.
- split the data in train and test sets (+ normalise independent variables where required)
- fit the model
- evaluate estimator performance
The data set I have used is virtually created and is not actual. So the prices may not reflect as real.