/scrapfilm

Screen Scraper for IMDB

Primary LanguageJavaScriptMIT LicenseMIT

IMDB Top 250 Screen Scraper

Scrapes the top 250 movie titles and release years from IMDB and outputs to csv format.

Why I wrote this

For a MOOC I'm taking on Information Visualization, wanted some IMDB data on top movies in csv format to conduct temporal and topical analysis.

How to run it

Go ahead and install node.js if you don't already have it.

Clone this repo, cd to project directory and run

npm install
node server.js

Open a browser and go to http://localhost:8001/scrape

Check your project directory, it should have generated scraped.csv with the movie data.

Inspiration

Thanks to this blog post for demonstrating how to use request and cheerio for screen scraping.