k-bx/boilerpipe

Ignore FORM tags in HTMLHighlighter

GoogleCodeExporter opened this issue · 1 comments

When using HTMLHighlighter some times boilerpipe keeps some artifacts related 
coming from FORM and LABEL tags.

This can be easily prevented by addding a new ignorable element to TAG_ACTIONS 
map in HTMLHighlighter.java:

TAG_ACTIONS.put("FORM", TA_IGNORABLE_ELEMENT);


Original issue reported on code.google.com by xavi.beu...@gmail.com on 24 Mar 2012 at 6:40

  • Merged into: #44

Original comment by ckkohl79 on 25 Mar 2012 at 2:12

  • Changed state: Duplicate