Replace the HTML Parser
WilcoFiers opened this issue · 1 comments
WilcoFiers commented
The current parsers is only able to return the last error it occures. The error may not even be correct as some problems cascade and cause element to fail incorrectly. There is probably no way to fix this without rewriting the parser all together. Only the error message is returned, so no code snippet can be extracted either.
A replacing parser should have the following features
- Identifies all errors regarding incorrectly nested tags
- Identifies duplicate attributes
- Allows self closing tags
- Prohibits closing tags of element that are not permitted to have them
- Applies XML rules to pages with XML doctype (self closing met be with "/>")
- Returns code snippet of with errors