Deprecated: Use http://navya.github.io/oars/ which now uses scrapoars as it's backend, which is a lot less hacky.
A toolchain to scrape the OARS website to generate a clean course listing. Works inside iitk only.
git clone https://github.com/rejuvyesh/poars
cd poars
npm install json2csv
make
You'll probably need to install elinks
as well.
poars.py
: Scrapes data from OARS and saves as html.
check.rb
: Checks and removes empty htmls.
dump.rb
: Dumps html files to text.
parse.py
: Parses the text files to generate a dictionary of all data.
jsn.py
: Dumps the python dictionary as json.
See in action at: http://rejuvyesh.com/poars/ or http://navya.junta.iitk.ac.in/oars/