/Scot_covid19

results of covid19 testing in Scotland

Primary LanguageHTMLThe UnlicenseUnlicense

Scot_covid19

Final update 26 April 2020. After exactly six weeks, and 120 commits to this repo, I've retired it. The data will no longer be updated.

It was only ever designed to be a stop gap for a week or so while Scottish Government started to publish the daily data contained in the static HTML page as open data.

As of Friday, SG now produce not only the XLSX files that they introduced 10 days or so ago, but now their own Github repo of CSVs. I've been helping them test the data over a couple of days.

Publication of that seems to be up to an hour after the offical HTML publication, but I know staff are doing this in their own time at weekends. The fact that it is being maintained openly and daily is great!

So, Ive advised all of those with whom I am in contact, who I know are using the data for analysis or data viz, to point their code at the new government repo.

Ian

Past updates

I've uploaded an improved scraper Get Daily Data.Ipynb. If you have Jupyter Notebooks you can download and run it for yourself. It is a scraper - so it is brittle. Any small change in the target page can break it - as has happened three times in development. I'm also now capturing NHS Scotland staff absences.

On 08 Apr 2020 the main Scottish Government page of Coronavirus information was moved to a new location. See data below.

On 02 Apr 2020__ Today the Scottish Government announced that the process for recording deaths had been changed. The data was updated tonight to reflect that.

I have added the number of patients in Intensive Care from news conferences attended by the First Minister or Chief Medical Officer.

I blogged about the need for better open data from the government in Scotland.

This primarily consists of text files (CSV) containing results of Covid-19 testing in Scotland.

Data

This now sits in a /data directory with subdirectory for raw and processed data.

Data Sources

  • The source of data from 06 March to 13 March 2020 is the Internet Wayback Machine.
  • From 14 March 2020 the data is sourced directly from the Scottish Government's webpage, and for now is manually added.
  • From 8th April 2020 the main Scottish Government Coronovirus information moved to this webpage although it appears that the page above is also still being maintained.
  • A friend supplied me with data from 24 Jan 2020 to 6th March to fill in gaps.

Contains public sector information licensed under the Open Government Licence v3.0.

Other data

I've added the Health Board codes from the NHS Scotland Open Data Store and a July 2019 poulation estimate file from NHS Scotland Open Data Store. Both are published under OGL.

Processed data

I've created a Jupyter Notebook, initial_data.ipynb which is used to calculate the 2018 populations of each Health Board, and save that as HB_Populations.csv.

Other resources

I also recommend Tom White's Github repo Covid-19-UK-Data which is broader and more code-driven for now. We may start collaborating / merging these.