Introduction

Airbnb is an online marketplace for arranging or offering lodging, primarily homestays, or tourism experiences. Since 2008, guests and hosts have used Airbnb to expand on traveling possibilities. It is a more unique, personalized way of experiencing the world and meeting with new people.

This dataset (from Kaggle) describes the listing activity and metrics in NYC, NY for 2019. This data file includes all needed information to find out more about hosts, geographical availability, necessary metrics to make predictions and draw conclusions.

The analysis could provide some insights on the hosts and areas, answering questions like whether locations, room type, reviews would affect price.

Packages

matplotlib seaborn Counter pandas numpy sklearn

Conclusions

  • Price is highly right skewed.
  • Room type and neighbourhood would affect the price.
  • High price lists are clustered in Manhattan, and the closer to Manhattan, generally yield a higher price.
  • Lists have lower price like to use 'cozy', lists are more expensive like to use 'studio' and 'apartment'.

Resources

https://www.kaggle.com/dgomonov/new-york-city-airbnb-open-data