pytorchOCR: A C++ repository from fxwfzsxyq

基于pytorch的OCR库

训练只在ICDAR2015文本检测公开数据集上，算法效果如下：

模型	骨干网络	precision	recall	Hmean	下载链接
DB	ResNet50_7*7	85.88%	79.10%	82.35%	下载链接(code:fxw6)
DB	ResNet50_3*3	86.51%	80.59%	83.44%	下载链接(code:fxw6)
DB	MobileNetV3	82.89%	75.83%	79.20%	下载链接(code:fxw6)
SAST	ResNet50_7*7	85.72%	78.38%	81.89%	下载链接(code:fxw6)
SAST	ResNet50_3*3	86.67%	76.74%	81.40%	下载链接(code:fxw6)
PSE	ResNet50_7*7	0%	0%	0%	下载链接(code:fxw6)
PSE	ResNet50_3*3	0%	0%	0%	下载链接(code:fxw6)
PAN	ResNet18_7*7	81.80%	77.08%	79.37%	下载链接(code:fxw6)
PAN	ResNet18_3*3	83.78%	75.15%	79.23%	下载链接(code:fxw6)

image
│   .jpg
│   .jpg   
│		...

需要一个train_list.txt , 格式：图片绝对路径+\t+label。具体可参照项目中data/example中例子。如果训练过程中需要做验证，需要制作相同的数据格式有一个test_list.txt。

python3 ./tools/rec_train.py

python3 ./tools/rec_infer.py