The California housing data set contains information from the 1990 California census. The summary stats of the California housing is described by the 10 columns below:
longitude
latitude
housing_median_age
total_rooms
total_bedrooms
population
households
median_income
median_house_value
ocean_proximity
The data requires pre-processing and is a perfect dataset to implement feature engineering. This is a good example for going through all the steps required for building a good machine learning algorithm.