ocr-text

There are 3 repositories under ocr-text topic.

  • marijnkoolen/fuzzy-search

    Fuzzy search modules for searching lists of words in low quality OCR and HTR text.

    Language:Python21301
  • ruoyuxie/noisy_parallel_data_alignment

    Enhanced awesome-align for low-resource languages and noise simulation: https://arxiv.org/abs/2301.09685

    Language:Python7211
  • MehulGoel1/ocr.text.search

    This allows to search text among all the image (screenshot) files in a specified folder and it returns a list of file names in which all, it found the text. It runs ocr always on just the newly added files for lesser time consumption. When any screenshots or images are removed from the folder thier corresponding text file is archived not deleted, and hence they not searched for the text.

    Language:Python3102