The goal of this repo is to purpose a python script to extract a tabular dataset from unstructured pdf from LCL.
- Be sure to have all the dependencies installed (need to create requirements.txt)
- Export all your pdf from your LCL client platform.
- Put them in the
input_data
folder - Run the python script and you will get a data.csv
It is currently under development and you may get error not yet fixed, so please do not hesitate to open issue and to contribute.
By the way it is a total unofficial repo.