dehyphenation
There are 2 repositories under dehyphenation topic.
pd3f/dehyphen
📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF
pd3f/pd3f-core
📑 Python Package to reconstruct the original continuous text from PDFs with language models