jonaswinkler/paperless-ng

[BUG] Scan produces word with spaces between letters

tpre opened this issue · 1 comments

tpre commented

Describe the bug
Scan produces word with spaces between letters: K o n t o a u s z u g v o m 3 0 . 0 8 . 2 0 1 9
This way paperless-ng is absolutely unusable :-(

To Reproduce

  1. Install docker version of paperless-ng
  2. add any pdf-Document

Expected behavior
no additional spaces between letters of words

Screenshots
Part of pdf:
grafik

Relevant information

  • latest docker version
  • debian 10
  • Installation method: docker
  • No configuration changes made
tpre commented

I did not know, that pdf with text layer is not touched by ocrmypdf. But it is really curious. I scanned all my documents with scansnap to searchable pdfs. When i open them on osx and copy text, everything is fine. When i open the same pdf an windows, then there are additional spaces after each letter. So why the hell is it so? Hard to find any information on the internet.