/allocine_scrapping_data

This algorithm is implemented to scrapping all the movies info of AlloCiné web site

Primary LanguagePython

🎦 allocine_scrapping_data

📖 Description

AlloCiné is an information company on French cinema and reviews of the press and its users for a large number of films. In this code we offer the possibility to scrap all information about movies

🗃️ The Visualization Data

🎉🎉🎉 Update : the link to visualize analysis of data from allociné is available here

🎉🎉🎉 Update : source code of data visualization using streamlit is available here

📝 Description of the data

We provide the dataset in csv version (brut and clean versions) : allocine_movies_brute.csv

ℹ️ The Columns :

  • title : the movies title (in french)
  • release_year: the original release date
  • re_release_date: the re-release date
  • duration: the movies length
  • genres : the movies types (as an array, up to three different types)
  • directors : movies directors (as an array)
  • actors : main movie characters (as an array)
  • nationality: nationality of the movies (as an array)
  • press_rating: press ratings (from 0 to 5 stars)
  • nb_press_vote: number of press votes
  • spec_rating: AlloCiné users ratings (from 0 to 5 stars)
  • nb_user_vote: number of users votes

🚀 Getting Started

streamlit run app.py