BUGS-NYU/schedge

Scrape all available data on Schedge v1

A1Liu opened this issue · 8 comments

A1Liu commented
Scrape all available data on Schedge v1
A1Liu commented

Rescrape, to fix invalid/missing data from previous scrapes:

  • fa2020
  • ja2021
  • sp2021
  • su2021
  • fa2021
  • ja2022
  • sp2022
  • su2022
  • fa2022
  • ja2023
  • sp2023
A1Liu commented

^ Need to re-do again because we're missing section titles and timezones are fucked

for time zones, need to store local time in DB and then dynamically convert when serving the data, because I don't see another way to not delete information.

A1Liu commented

^ Need to re-do again once the bug with course titles is fixed.

A1Liu commented

Should probably use a Github workflow to scrape; logging into lightsail is a bit silly
https://docs.github.com/en/actions/managing-workflow-runs/manually-running-a-workflow

(done)

A1Liu commented

The only one that Schedge v1 has that Schedge v2 doesn't have is su2020

A1Liu commented

Need to add the --v1 flag to the scrape job, then deploy again