Pinned Repositories
Image-Seek
A port of ImgSeek to Perl
LCS-BV
Longest Common Subsequence implemented with Bit-Vectors
Lingua-Stem-Cistem
CISTEM Stemmer for German
Lingua-YI-Romanize
Transliterate Yiddish from Hebrew to Latin script
ocr-deu-bio-testfiles
German language (nature, biology) ground truth
ocr-eng-bio-testfiles
OCR English (Bio, Natur) ground truth and testfiles
ocr-gt-AustrianNewspapers-scripts
Scripts for AustrianNewspapers
Set-Similarity
Set::Similarity - Similarity measures for sets
Text-Guess-Language
Guess langauge from Text using top 1000 words
Text-Levenshtein-BV
Levenshtein using bit vectors
wollmers's Repositories
wollmers/Text-Levenshtein-BV
Levenshtein using bit vectors
wollmers/ocr-measures
scripts reporting scores and statistics
wollmers/Text-Levenshtein-BVXS
Text::Levenshtein::BVXS - fast implementation using bit vectors
wollmers/ocr-deu-bio-testfiles
German language (nature, biology) ground truth
wollmers/AustrianNewspapers
NewsEye / READ OCR training dataset from Austrian Newspapers
wollmers/Cor
Corinna - Bring Modern OO to the Core of Perl
wollmers/Denoising-Diffusion-Probabilistic-Models-with-MNIST
This notebook is based on the paper Denoising Diffusion Probabilistic Models by Jonathan Ho, Ajay Jain and Pieter Abbeel. The porpuse of this notebook is to understand the basic idea of the paper.
wollmers/Dewarping-Dataset-Annotations
wollmers/fancy-memset
small, fast memset based on microsoft's design
wollmers/github-workflows
wollmers/guacamole
Guacamole is a parser toolkit for Standard Perl. It provides fully static BNF-based parsing capability to a reasonable subset of Perl.
wollmers/hocrmod
Try to find regions missed by Tesseract.
wollmers/ImageInpaintingChallenge2022
wollmers/Levenshtein-Simple
Levenshtein algorithm in the simple or "naive" implementation as a reference
wollmers/limboole
Fork of the Limboole SAT solver frontend from http://fmv.jku.at/limboole/ modified to be executable using WebAssembly on the web.
wollmers/Lingua-DE-Fathom
Measure readability of German text
wollmers/Lingua-DE-Syllable
Count the number of syllables in German words.
wollmers/LiTeX
Live Text Command Line Tool
wollmers/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
wollmers/ocr-bbox-gt
Ground Truth for Bounding Boxes
wollmers/ocr-gt-tools-mojo
OCR GT tools implemented with Mojolicious
wollmers/OpenCV-Document-Scanner
An interactive document scanner built in Python using OpenCV featuring automatic corner detection, image sharpening, and color thresholding.
wollmers/perlcldr
Perl module to use the Common Local Data Repository from the Unicode Consortium
wollmers/py-lcs-bv
LCS using Bitvectors in Python
wollmers/Python
All Algorithms implemented in Python
wollmers/rapidfuzz-cpp
Rapid fuzzy string matching in C++ using the Levenshtein Distance
wollmers/sudoku-sat
wollmers/Text-Levenshtein-Uni
Text-Levenshtein-Uni - calculate Levenshtein distance for Unicode (UTF-8 or U32) strings
wollmers/try-github-actions
wollmers/utf8-bench
utf8-bench - UTF-8 Benchmarks