Mini-study aiming to measure the OCR quality of Tesseract for the already existing text blocks in the newspaper dataset. The goal is to see if Tesseract is better than the current OCR solution, and comparisons are made using a manually annotated gold standard.