buriy/python-readability

Problems with thecyberwire.com

097115 opened this issue · 0 comments

Take this page, for example: https://thecyberwire.com/newsletters/policy-briefing/4/28:

  • doc.summary() returns only the main text, the first 3 paragraphs, but completely skips the SELECTED READING section.

Or, take this page: https://thecyberwire.com/newsletters/daily-briefing/11/29

  • here, on the contrary, doc.summary() returns only the SELECTED READING section, but skips the SPECIAL SECTION :)

Would be great to find some solution.