Start up http-kit webserver with
boot start-server
Server will be running on localhost:3000
Run webscraper
boot run-scraper
Where the hell is all this data going to live? Do we need a hadoop cluster to churn through it all? (hope not) What ML tools should we be using? What does the interface to all of this look like?