/LCL_extract_data_from_pdf

The goal of this repo is to purpose a python script to extract a tabular dataset from unstructured pdf from LCL.

Primary LanguagePython

LCL_extract_data_from_pdf

The goal of this repo is to purpose a python script to extract a tabular dataset from unstructured pdf from LCL.

alt text

How it works

  • Be sure to have all the dependencies installed (need to create requirements.txt)
  • Export all your pdf from your LCL client platform.
  • Put them in the input_data folder
  • Run the python script and you will get a data.csv

Be aware

It is currently under development and you may get error not yet fixed, so please do not hesitate to open issue and to contribute.

By the way it is a total unofficial repo.