dankito/Readability4J

Long post from beatportal only returns text from the middle

Pranoy1c opened this issue · 1 comments

I am testing using the HTML from this page:

https://www.beatportal.com/features/beatports-definitive-guide-to-techno/

It only seems to return the output from the middle of the page:

From:
Mark Ernestus, founder of record shop Hard Wax, was also instrumental...

till:

....a glimpse into the future via the new techno sound

I realize the site's HTML is pretty annoying but is there a way to fix this? Safari's Reader mode works perfectly on this. Firefox doesn't though (gives same output as your library).

Then i don't think there will be a workaround as this library is based on the code of Firefox's reader mode. So what in Firefox's reader mode doesn't work, will also not work here.