/poars

Saner oars

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Saner OARS [DEPRECATED]

Deprecated: Use http://navya.github.io/oars/ which now uses scrapoars as it's backend, which is a lot less hacky.

What's this?

A toolchain to scrape the OARS website to generate a clean course listing. Works inside iitk only.

Usage

git clone https://github.com/rejuvyesh/poars
cd poars
npm install json2csv
make

You'll probably need to install elinks as well.

About

poars.py: Scrapes data from OARS and saves as html.

check.rb: Checks and removes empty htmls.

dump.rb : Dumps html files to text.

parse.py: Parses the text files to generate a dictionary of all data.

jsn.py : Dumps the python dictionary as json.

See in action at: http://rejuvyesh.com/poars/ or http://navya.junta.iitk.ac.in/oars/