/crawler

Website crawler and scraper

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

crawler

Website crawler and scraper

version

  1. imdb/ folder
    • scrape the IMDb 50 highest rated movies
    • crawl the IMDb top 250 movies
  2. factbase/ folder
    • crawl the list of Donald Trump's speeches and interviews (a website with scroll-to-load feature), and scrape the transcripts

Author

Zhongyu Chen