This repository is the accompanying repository for the Python Crash Course live training. You can find the dataset used in the data
folder, and the images used in this training in the assets
folder.
- Import data into Python using
pandas
— Python’s most popular data analysis package. - Filter, add new columns, and analyse datasets using pandas.
- Present data visualizations using
matplotlib
andseaborn
— Python's most popular data visualization packages.
The dataset to be used in this training is a CSV file named airbnb.csv
, which contains data on airbnb listings in the state of New York. It contains the following columns:
listing_id
: The unique identifier for a listingdescription
: The description used on the listinghost_id
: Unique identifier for a hostneighbourhood_full
: Name of boroughs and neighbourhoodscoordinates
: Coordinates of listing (latitude, longitude)listing_added
: Date of added listingroom_type
: Type of roomrating
: Rating from 0 to 5.price
: Price per night for listingnumber_of_reviews
: Amount of reviews receivedreviews_per_month
: Number of reviews per monthavailability_365
: Number of days available per yearnumber_of_stays
: Total number of stays thus far
- Question 1: What is the distribution of price per room type?
- Question 2: What is the number of listings per borough?
- Question 3: What is the number of listings per year?
- Question 4: What is the number of listings per year in each borough?