Data on the Mind 2017, Collecting Data from the Web

This repo contains webscraping and API materials for the 2017 summer workshop Data on the Mind.

Required Python libraries:

  • bs4
  • lxml or html5lib
  • selenium
  • pandas

Much of these materials were adapted from materials by Rochelle Terman and UC Berkeley's D-Lab.