Here is the Medium blog post I have written: https://medium.com/@cp.josejesus/tmdb-movies-dataset-analysis-11ebf31eb6cd
Project Motivation This project (Write a Data Science Blog Post) is part of Data Scientists Program.
I used The TMDB Movies Dataset for this project. The dataset It is a popular database for movies and TV shows. The orijinal dataset can be found here: https://www.kaggle.com/datasets/tmdb/tmdb-movie-metadata
This project focuses on answering following questions:
- What are the 10 most popular movies?
- Which 10 films had the biggest budget?
- Which 10 films had the highest revenue?
- Which actors did the most amount of movies?
- What are the 10 most used genres?
Libraries It was used Python3. Here are the libraries I used in my Jupyter Notebook:
Numpy
-
Pandas
-
Seaborn
-
matplotlib.pyplot
-
tmbd-movies.csv Original dataset in csv format