/pdf-extraction-and-reporting

Extract relevant data from PDF document and use it for reporting or prediction. PDFminer / Cermine / GROBID

Primary LanguageJupyter Notebook

1. PDFminer extraction and reporting

Extract relevant data from PDF document and use it for reporting or prediction. PDFminer / Cermine / GROBID

Crawl web to extract relevant data from PDF report (USDA livestock, poultry & grain market news report, sample as below).
Download the report.
Extract the highlighted red infomation - past week Ethanol price ($/gal) in Iowa city.
Extract data for past multiple weeks and use it for future Ethanol price ($/gal) prediction in Iowa city.

For latest report, visit: https://www.ams.usda.gov/mnreports/lswagenergy.pdf.

pdfminer-online-report-sample