text-digitisation
There are 3 repositories under text-digitisation topic.
hyeonsangjeon/computing-Korean-STT-error-rates
STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지
jo-valer/tesseract-ocr-enhanced
Preprocessing methods to enhance Tesseract-OCR in the case of printed text on difficult background, or handwritten text on lined/squared paper.
polifonia-project/textual-corpus-population
Repository containing code for downloading and digitising textual documents used as a corpus for the Polifonia Project.