cmccambridge/ocrmypdf-auto

Feature: Experiment with building tesseract v4 on Alpine Linux

cmccambridge opened this issue · 2 comments

Not sure how much size savings could be realized by switching to Alpine, given how many other packages get pulled in to satisfy ocrmypdf dependencies and their dependencies, but the biggest obstacle up front is that the only tesseract-ocr available for Alpine seems to be v3.05, which is considerably poorer performing than the not-yet-release v4 code.

This has now been done, at least in alpine edge, following official release of tesseract 4.0!