averagehuman/python-html2rest

Linked are broken

Opened this issue · 2 comments

All links come out as this without any references:

`Contact us to learn more`_

This project looks abandonned.
I now use Pandoc instead:

$ pandoc --from=html --to=rst < README.html > README.rst

Hope this helps.

@AurelienLourot: Thank you for the suggestion but that's not going the direction I'm looking for.

The goal is to avoid reliance on third party applications, like Pandoc, so the code will be significantly more portable.

I also found that Pandoc produced some sloppy restructuredtext formatting so I ended up writing my own that serves my needs and relies on no external binaries, just uses lxml (because I like lxml.objectify much more than built-in XML DOM utilities and it's significantly faster).