USC Class Schedule Listing Scraper
This program is a little scraper for the USC Class Schedule website.
It is written as a Python Notebook called ParseSchedule.ipynb (click that to view the code).
It uses:
- BeautifulSoup
- csvkit
- requests
I used it to scrape all USC Columbia departments for Spring 2015, which came out to 11,110 section times in total.
You can grab the resulting files schedule.csv and schedule.json. A nightly cron job updates them.
Contribute
I'm sure there are bugs. Let me know what you find. Even better, fix it yourself and send in a pull request.
Also, I did not grab everything I could, just everything I thought people would want. If you want me to scrape something else, let me know.
Deployment
To deploy I just have to "File->Download as .py"