dsfsi/covid19za

[DATA] Blip in the data

Diederikjh opened this issue · 6 comments

It looks like there is a bit of a blip in the test data.

See attached screenshot

Screenshot_20211124-063022

Thanks @Diederikjh will check

This is a pretty important fix that needs to be done. It seems Google is using the data from this repo- and with this new variant and all the news around it, when people search for "South Africa Covid" this graph with the blip shows up now. It looks very scary, as if this new variant has created a sudden crazy surge, and its likely having real repercussions.

It appears as if there were some historic revisions to the data, artificially raising the number of positive cases by 18,000 from 23-Nov onwards. The number here: https://sacoronavirus.co.za/2021/11/23/update-on-covid-19-tuesday-23-november-2021/ reports 2.948m cumulative cases, where as the previous day it was 2.930m. See: https://sacoronavirus.co.za/2021/11/22/update-on-covid-19-monday-22-november-2021/
However, the "new cases" number does not reflect the 18k increase, it shows less than a 1000.... for the 23rd.

Have been using this data for my website, what is the best way to handle revisions such as this for the graphical display of accurate daily new cases? Would it not be beneficial to add to the data set the officially reported daily new confirmed cases in addition to the cumulative total confirmed cases?

@thomaslane Thanks so much for your concern. Its just that time of the year in teaching and exams so I have not had the time to give full attention to make a suggestion. Solid suggestions are welcome and we can involve some of the very active contributors such as @krokkie and @lrossouw

Another idea: I've seen Time magazine's covid newsletter including backdated adjustments in the total infection number, but not the daily infection rate number. They did have an asterisk next to that row in the table for a few weeks,to explain the anomaly.