/parseclassschedule

A Scraper for USC's Class Schedule Listing website

USC Class Schedule Listing Scraper

This program is a little scraper for the USC Class Schedule website.

It is written as a Python Notebook called ParseSchedule.ipynb (click that to view the code).

It uses:

  • BeautifulSoup
  • csvkit
  • requests

I used it to scrape all USC Columbia departments for Spring 2015, which came out to 11,110 section times in total.

You can grab the resulting files schedule.csv and schedule.json. A nightly cron job updates them.

Contribute

I'm sure there are bugs. Let me know what you find. Even better, fix it yourself and send in a pull request.

Also, I did not grab everything I could, just everything I thought people would want. If you want me to scrape something else, let me know.

Deployment

To deploy I just have to "File->Download as .py"