ocr-text
There are 3 repositories under ocr-text topic.
marijnkoolen/fuzzy-search
Fuzzy search modules for searching lists of words in low quality OCR and HTR text.
ruoyuxie/noisy_parallel_data_alignment
Enhanced awesome-align for low-resource languages and noise simulation: https://arxiv.org/abs/2301.09685
MehulGoel1/ocr.text.search
This allows to search text among all the image (screenshot) files in a specified folder and it returns a list of file names in which all, it found the text. It runs ocr always on just the newly added files for lesser time consumption. When any screenshots or images are removed from the folder thier corresponding text file is archived not deleted, and hence they not searched for the text.