jsfenfen/whatwordwhere
Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.
HTML
Issues
- 1
sync django model names and non-django parts
#6 opened by jsfenfen - 1
- 1
clean namespaces off some xml hocr docs
#3 opened by jsfenfen