tnef_attachement.name is not unicode

Question

tnef_attachement.name is not unicode

guettli opened this issue 10 years ago · 9 comments

I get a UnicodeError in my application code, because tnef_attachement.name is not unicode.

Could the tnef library get updated, to make tnef_attachement.name a unicode string?

In my case it looks like a latin1 string. It would improve the usability of the library if the application programmer does not need to do string decoding.

Sorry, I can't post the tnef binary, since it is from a customer.

BTW, how can you create tnef test binaries?

Answer 1 · 2017-04-05T12:15:25.000Z

fixed in fbcecac if you call long_filename() AND there is a long filename to return

Answer 2 · 2017-04-06T14:33:58.000Z

Unfortunately I don't have the matching test here to proof if the patch solves this issue.

I trust you and I think this issue can be closed.

Should I close it?

Answer 3 · 2017-04-07T12:41:05.000Z

I'm not in position to make that decision.

Answer 4 · 2017-11-03T20:43:36.000Z

Please, if you can, provide a PR.

Answer 5 · 2018-03-01T20:03:35.000Z

Note: master now strips null bytes from long_filename as well.

Answer 6 · 2018-03-02T09:48:53.000Z

The encoding of tnef attachment (long) names is a bit tricky to get right, it's not exactly simple.

See for example the discussion at roundcube/roundcubemail#5646 .

If you have good understanding of how the encodings in tnef attachments work or any links to information, that would help.

Answer 7 · 2018-03-02T10:28:15.000Z

Note: we should also consider how this is to work on Python2 vs. Python3 (with help of six?)

Answer 8 · 2018-11-29T06:52:30.000Z

This should be addressed by #31

Answer 9 · 2018-12-04T09:17:55.000Z

thank you!