dankito/Readability4J

Prune aside tags

lenzls opened this issue · 0 comments

Hello,

mozilla's readbility filters out <aside> tags before processing the html further, as can be seen in https://github.com/mozilla/readability/blob/master/Readability.js#L633.

Readbility4J however does not do this https://github.com/dankito/Readability4J/blob/master/src/main/kotlin/net/dankito/readability4j/processor/ArticleGrabber.kt#L753

I understood, that this library tries to be an exact copy of the one from mozilla, so I want to file this as a bug.

regards and thanks for the awesome software!

Simon