matthewwithanm/python-markdownify

UnicodeDecodeError

Opened this issue · 1 comments

I frequently get errors like UnicodeDecodeError: 'utf-8' codec can't decode byte 0xf7 in position 14: invalid start byte.

BS4 has the option to ignore errors. Can we pass that option down from Markdownify?

@bprager - are you able to provide a minimal example, perhaps by printing the repr() of a small piece of input that causes this so the problematic special characters are escaped and included?