HTML is broken.
Closed this issue · 4 comments
Deleted user commented
original HTML:
<span id="language">
| Language:
<a title="Switch language to English" href="?lang=en">
<img alt="Switch language to English" src="/resources/images/en.png" class="flag">
</a>
<a title="Switch language to Deubuilderh" href="?lang=de">
<img alt="Switch language to Deubuilderh" src="/resources/images/de.png" class="flag">
</a>
<a title="Switch language to Espanol" href="?lang=es">
<img alt="Switch language to Espanol" src="/resources/images/es.png" class="flag">
</a>
</span>
translated HTML:
<span id="language">
|言語
</span>
Deleted user commented
Fixed.
Deleted user commented
This is not fixed yet.
Deleted user commented
We need to look for another HTML parser or need to fix xsoup.
Deleted user commented
xsoup seems not to parse HTML properly. I will replace xsoup with htmlparser.
http://search.maven.org/#artifactdetails%7Cus.codecraft%7Cxsoup%7C0.3.1%7Cjar
http://search.maven.org/#artifactdetails%7Cnu.validator%7Chtmlparser%7C1.4.3%7Cjar