/Releasing_Action_movies

This project gives analysis on when best to release Action movies and along with other factors to consider when expecting great return from your Action movie.

Primary LanguageJupyter Notebook

Releasing_Action_movies

Project: The Movie Database Analysis

Table of Contents

## Introduction

The Movie Database is home to a lot of metrics concerning movies. These metrics include the movies budget, revenue, popularity ,title, cast, reviews, runtime etc. In this project i am working with a dataset that contains 10866 movies. I wish to analyze and point out the best month and day to release Action movies based on total movie revenue for each month and day and also based on the budget used to produce such movie.

Question(s) posed

  • When is the best time to release action movies?
  • Does budget correlate with revenue?
  • How long should an Action movie be?

How answers were arrived at

The revenue and release date are the major variables used to arive at the answer in this project. The release date with the highest revenue was used to determine the best time to release an Action movie since the success of a movie is generally measured by its revenue. Any action movie producer would love this advice.

Limitation

The major limitation is that production companies try as much to withold budget for movies and as such it is difficult to get the budget for some movies and also some of the budgets are estimated sums.

Data Wrangling

In this section of the project, i load in the data, check for cleanliness and tidiness issues in the data, and then trim and clean the dataset for analysis.Steps are clearly documented and justified.

General Properties(Inspection)

I used the .shape, .info(), .describe(), .isnull() and .duplicated() to quickly inspect the data.

Assesment

Quality issues

  • Null values exist in the dataset
  • Release year should be changed to datetime type
  • Drop duplicates
  • Movies with budget less than $$60000$ and revenue less than $$368000$

Tidiness issues

  • genre column has more than one variable

Data Cleaning

Quality and Tidiness issues Here i addressed all issues stated in the assessment.

Exploratory Data Analysis

After this i performed Exploratory Data Analysis to come to the conclusions

Conclusions

From the analysis made on the movie database on when to release Action movies with the Revenue as the determining factor(variable) it is advisory to release an Action Movie in May and preferably on a Wednesday as this month proves to produce the heighest revenue of over 160m dollars over the years from 1966 -2015. It will also do good to spend heavily in action movies as shown by the analysis over the years in other to get more revenue as there is a correlation between between budget and revenue. Finally Action movies that have a runtime of about 100mins make more revenues.