TRoYals/ocr-ChatGPT

use Baidu Form OCR API to generate full form data

Python

About the Project

Use baidu OCR Form api and ChatGPT api to extract FORM from the PDF.

A sample project to serve as the first step to data anylsis.

How to use?

text your ocr api and ChatGPT api in the config.ini
put your pdf in the user_file folder.
adjust your needed prompt in the config.ini
simply run the src/main.py and you can see all the temp form in the temp folder and display form in the output folder.

项目状态

2023-05-30 10:34 基本满足最小实现要求, 确认需求后再继续改进

2023-06-01 15:44 基本完成了，满足 zoe 的需求，但 ocr 识别上存在的问题还是蛮明显的，考虑要不要换 ocr 识别。

Todo