/tmdb_movies

Primary LanguageJupyter Notebook

TMDB Dataset

Here is the Medium blog post I have written: https://medium.com/@cp.josejesus/tmdb-movies-dataset-analysis-11ebf31eb6cd

Project Motivation This project (Write a Data Science Blog Post) is part of Data Scientists Program.

I used The TMDB Movies Dataset for this project. The dataset It is a popular database for movies and TV shows. The orijinal dataset can be found here: https://www.kaggle.com/datasets/tmdb/tmdb-movie-metadata

This project focuses on answering following questions:

  • What are the 10 most popular movies?
  • Which 10 films had the biggest budget?
  • Which 10 films had the highest revenue?
  • Which actors did the most amount of movies?
  • What are the 10 most used genres?

Libraries It was used Python3. Here are the libraries I used in my Jupyter Notebook:

Numpy

  • Pandas

  • Seaborn

  • matplotlib.pyplot

  • tmbd-movies.csv Original dataset in csv format