anisayari/LCL_extract_data_from_pdf

The goal of this repo is to purpose a python script to extract a tabular dataset from unstructured pdf from LCL.

Python

LCL_extract_data_from_pdf

The goal of this repo is to purpose a python script to extract a tabular dataset from unstructured pdf from LCL.

How it works

Be sure to have all the dependencies installed (need to create requirements.txt)
Export all your pdf from your LCL client platform.
Put them in the input_data folder
Run the python script and you will get a data.csv

Be aware

It is currently under development and you may get error not yet fixed, so please do not hesitate to open issue and to contribute.

By the way it is a total unofficial repo.