/Project-2---Movie-Analysis-Udacity

This data set contains information about 10,000 movies collected from The Movie Database (TMDb), including user ratings and revenue.

Primary LanguageJupyter Notebook

Project-2---Movie-Analysis-Udacity

This data set contains information about 10,000 movies collected from The Movie Database (TMDb), including user ratings and revenue.

Project Overview

This data set contains information about 10,000 movies collected from The Movie Database (TMDb), including user ratings and revenue. Certain columns, like ‘cast’ and ‘genres’, contain multiple values separated by pipe (|) characters.There are some odd characters in the ‘cast’ column. Don’t worry about cleaning them. You can leave them as is.The final two columns ending with “_adj” show the budget and revenue of the associated movie in terms of 2010 dollars, accounting for inflation over time.

EDA

  • Which genres are most popular from year to year?
  • What genres of movie is most popular?
  • What production company produces the most movies?
  • What movie has the highest revenue?

Data

This data set comes from Kaggle and can be found here: https://www.kaggle.com/tmdb/tmdb-movie-metadata/code