Fix prescan bug: content attribute, second charset
openandclose opened this issue · 1 comments
openandclose commented
#data
<meta http-equiv="Content-Type" content="charsetcharset=iso-8859-2">
#encoding
iso-8859-2
A content attribute parser should discard invalid 'charset' string and search next one.
While html5lib._inputstream.EncodingParser returns: 'None'.
openandclose commented
This is the same bug as:
validator/validator#877
jsdom/html-encoding-sniffer#7 (fixed)