jonaswinkler/paperless-ng

[BUG] Downloaded filenames don't support Unicode

mpldr opened this issue · 3 comments

mpldr commented

Describe the bug
When downloading a document where the Correspondent/Title contains special characters, these characters are mangled:

- 2022-10-18 Stadt Gro�enhain Meldebest�tigung.pdf
+ 2022-10-18 Stadt Großenhain Meldebestätigung.pdf

To Reproduce
Steps to reproduce the behavior:

  1. Find document in list
  2. Click on 'Download'
  3. See Filename

Expected behavior
The filename does not contain invalid symbols

Screenshots
Not applicable

Relevant information

  • Host OS of the machine running paperless: Archlinux
  • Browser Firefox Android
  • Version 1.5.0
  • Installation method: docker

paperless-ng is pretty much abandoned. Have a look at https://github.com/paperless-ngx/paperless-ngx for a maintained fork.

mpldr commented

Thanks, will do.

didn't solve it to me though...

[2023-01-14 00:44:37,557] [INFO] [paperless.management.consumer] Adding /usr/src/paperless/consume/34242536 - Einkommensteuererklrung.pdf to the task queue.

[2023-01-14 00:44:37,647] [ERROR] [paperless.handlers] Creating PaperlessTask failed: 'utf-8' codec can't encode character '\udce4' in position 30: surrogates not allowed

EDIT: worked for me as I installed the ubuntu base in german