ageitgey/node-unfluff

Doesn't seem to work for sites that use <div> tags instead of <p>

iannshan opened this issue · 2 comments

I tried this with a CNN.com article and it didn't work because they don't use paragraphs. Any suggestions for a work-around?

What specific article?

This is the one I tried: http://cnn.com/2016/11/01/politics/hillary-clinton-2016-campaign/index.html

The output was just the first paragraph of the article, which actually is in a p tag unlike the rest of the article.

I tried a number of other sites including Medium, NBC News, and a few random blogs and they all worked great. When I inspected the article on CNN though I saw their use of div tags and figured that could be the problem.