buriy/python-readability

summary removing non breaking space

dosomder opened this issue · 1 comments

The summary method removes non breaking space ( ) instead of inserting a simple space or keeping it.

Example:

<div id="t3_1" class="t s1_1">Dann&nbsp;haben&nbsp;wir&nbsp;ein&nbsp;unschlagbares&nbsp;Angebot&nbsp;für&nbsp;Sie!</div>

gets to

DannhabenwireinunschlagbaresAngebotfürSie!

Full html file is attached as text file
nbs_example.txt

buriy commented

Thanks, that's an important issue, I'll check.