scrape_solus
is a python utility to scrape all details from from the Queen's
course enrollment site.
git clone https://github.com/austinoboyle/scrape_solus
and cd into the folder- (Ideal, but not necessary) create a virtual env
pip install -e .
to install the project in development mode. This will allow you to easily make changes on the fly.- Set your SOLUS_USER and SOLUS_PASS env variables
- You should now have the
scrapesolus
command available. Runscrapesolus --help
to see available commands.
Usage: scrapesolus [OPTIONS]
Options:
- -t, --scrape_type alpha|interval (default=alpha). alpha: each job scrapes a letter. interval: each job scrapes every Nth course.
- -n, --num_workers INTEGER (default=8) number of selenium instances to run in parallel
- -o, --output_dir PATH Output directory for data dump
- -d, --deep BOOLEAN Do you want Section Data?
- -h, --headless BOOLEAN (default: True). Set to False for debugging.
- -l, --letter TEXT Scrape all courses that start with this letter
- -c, --course_code TEXT Scrape a specific course code
- --help Show this message and exit.
scrapesolus -c "MATH 281"
scrapesolus -h False -c "MATH 281"
scrapesolus -c "MATH 281" -d False
scrapesolus -l A
scrapesolus
scrapesolus -n 2