This repo contains a walkthrough notebook comparing "old school" Optical Character Recognition (ocr) models (like tesseract) to modern transformer / visual-language-model approaches (like kosmos).
neonwatty/ocr_model_comparisons
Compare and contrast detection and transformer based approaches to ocr.
Jupyter Notebook