/scraping_data_from_pdf

Code repository sample to demonstrate how to scrape table data from PDF file.

Primary LanguageJupyter Notebook

Scraping Data from PDF

Code repository sample to demonstrate how to scrape table data from PDF file. Please also check out my publication in Geek Culture's Medium (Scraping Data from PDF) for more detail.

  • scraping_data_from_pdf.ipynb <-- From data source "Arrangements for Compulsory Testing in respect of Buildings Resided by COVID-19 Cases with the N501Y/L452R variants in accordance with the Compulsory Testing Notice" at coronavirus.gov.hk.

  • scraping_data_from_pdf_2.ipynb <-- From data source "Details of Compulsory Testing Notice (G.N. (E.) 192 of 2022) - Places Visited by Tested Preliminarily Positive Cases/ Tested Positive Cases" at www.chp.gov.hk.

  • scraping_data_from_pdf_2_1.ipynb <-- From data source "Details of Compulsory Testing Notice (G.N. (E.) 192 of 2022) - Places Visited by Tested Preliminarily Positive Cases/ Tested Positive Cases" of latest (Today) at www.chp.gov.hk.