[The project is still under construction......]
Task | Detail | Statu |
---|---|---|
PDF->Text Data |
PyPDF2 | Processing |
Data Store | The initial decision to store structured data, such as txt | Processing |
Tokenizer Tool | Segmentation tool for text | Processing |
More... |
This is a preliminary idea, and the corresponding documentation will be created for ease of use after the tool is implemented.