This is a crawler and parser for CQPress. They have parsed the biography data from https://bioguide.congress.gov/ to retrieve information about congress people.
The file crawler.ipynb
is notebook that contains a basic crawler using requests to retrieve html files.
The file parse_people.ipynb
is a notebook where I combine filter and parse the people for critical characteristics.