/InmoElizaAnalisys

preliminary analysis to gather some information for Machine Learning ALgorithm

Primary LanguagePython

InmoElizaAnalisys

Preliminary analysis to gather some information for Machine Learning ALgorithm for the prediction of the house prices in Belgium.

DATA

We are working with a dataset created from scrapping the website inmoweb.be by looking for queries to get the prices of appartements and houses around Belgium. This dataset was cleaned applying the following criterias:

  1. Drop all the entirely empty rows
  2. Delete the blank spaces at the beginning and end of each string
  3. Fixing errors: Fixing variable hasFullyEquippedKitchen setting value '1' if the appartement has 'HYPER_EQUIPPED' or 'USA_HYPER_EQUIPPED' and '0' in the opposite case
  4. Dropping rows with price as NaN values
  5. Dropping duplicated values
  6. Deleting constant variable 'typeSale'
  7. Filling up empty values with np.NaN

image

VISUALIZATION

We first visualiza the relationship between the vairables and the target (price) in a heatmap:

image

  1. The three variables with the highest correlation are:
Variable Correlation
area 0.58
BedroomsCount 0.40
hasSwimmingPool 0.35
  1. The three variables with the highest correlation are:
Variable Correlation
hasSwimmingPool 0.11
hasGarden 0.07
isFurnished 0.01

Then we explore the behavior of the house prices in relation to the area. The data was standarize the data taking the logarithm of the price and the square root of the area

image

Some questions to answerd:

We also looked at the average price per square meter in different regions of belgium (Brussels, Wallonia, Flanders) and in the whole Belgium.

Brussels

Least expensive houses

image

Most expensive houses

image

Wallonia

Least expensive houses

image

Most expensive houses

image

Flanders

Least expensive houses

image

Most expensive houses

image

Belgium

Least expensive houses

image

Most expensive houses

image

CONCLUSIONS

We can see that in Wallonia we can find consierably less expensive houses and appartements than in Brussels or Flanders. Being Flanders the one with the most prices houses.

But at the same we can see that the mean price by square meter is bigger in Brussels.

image