/Movie-Spider

Primary LanguagePythonMIT LicenseMIT

Movie-Spider

Project Objectives

  • The purpose of this project is to crawl the available data set of the recommendation algorithm, mainly including users, movies (movie type, rating, release time, director, actor), user comments, comment time, etc.
  • Target crawling website: Douban
  • The desired data set structure:
    • (movie ID, movie type, release time, movie label)
    • (user ID, user rating, rating time, rating location)

Basic knowledge required

  • People with a certain front-end foundation
    • understand the basic page structure of the page
    • understand html/css/JavaScript/HTTP
  • People with basic Python programming
    • Install Pyhton3 and be able to run successfully
    • Understand the installation method of third-party libraries