The data scraped is no longer relevant, and I am not interested in fixing this project. You may still have a look through it if you are interested. ARC updated its website.
python scraper to get club data from https://www.arc.unsw.edu.au/
$ git clone git@github.com:HARI-PRMD/unsw-arc-scraper.git
$ pip install -r requirements.txt
$ python scraper.py
- the scraper first gets a full list of the clubs from https://arc-discovery.linkupevents.com/ which is stored in
/data/all_club_names.json
- then it gets each club page by calling https://arc-discovery.linkupevents.com/club/
- then it gets the JSON object from the
<script>
tag of the page - finally this data is stored in
/data/all_club_data.json
for you to use however you want