Renovation webpage in Russian
Some reads in English
Renovation - is a resettlement project of people who lived in low-cost apartament buildings, comonly named "Khrushevki", which was developed during the early 1960s Old buildings will be demolished and the new apartment will rise on their place.
Dataset1: 33 thousands buildings with differrent paraments(area, year of construction, etc) and ~4 thousands is in renovation programm
For more precise analysis of studied type of building ("khrushevki") and area around them 5 districts were selected (3 with a lot of old building, 2 wihtout them)
Dataset2: 5 thousands buildings from chosen districts, subsets of Dataset1
Dataset3: 4kk posts with geotags from vk.com from March 2017 to Augest 2017
2 classes: in renovation program or not
Classification was done in order to estimate weights vector and find most important features. 3 different classification models have been used: logic regression, decision tree and boosting. Models was traind with cross-validation on all data.
f1_score values:
Models/Datasets | 33k | 5k |
---|---|---|
Log.regr | 0.61 | 0.89 |
DeсisionTree | 0.71 | 0.91 |
Boosting | 0.79 | 0.93 |
In oreder to investigate the areas, where people speak about renovation word2vec was traind on data from vk. Using top50 words similar to "renovation" 2000 post was selected. This words was also used to build a cloud of words. After that LDA model was trained to compare the result topics with "rennovation" topic.
More info could be found in the notebook