Investigating-Netflix-Movies-and-Guest-Stars-in-The-Office

DataCamp First Project

Netflix! What started in 1997 as a DVD rental service has since exploded into the largest entertainment/media company by market capitalization, boasting over 200 million subscribers as of January 2021.

Given the large number of movies and series available on the platform, it is a perfect opportunity to flex my data manipulation skills and dive into the entertainment industry. My friend has also been brushing up on their Python skills and has taken a first crack at a CSV file containing Netflix data. For their first order of business, they have been performing some analyses, and they believe that the average duration of movies has been declining.

As evidence of this, they have provided me with the following information. For the years from 2011 to 2020, the average movie durations are 103, 101, 99, 100, 100, 95, 95, 96, 93, and 90, respectively.

If I am going to be working with this data, I know a good place to start would be to probably start working with pandas. But first I'll need to create a DataFrame from scratch. I will start by creating a Python object covered in Intermediate Python: a dictionary!