/html2text

Convert HTML to Markdown-formatted text.

Primary LanguagePython

html2text is a Python script that converts a page of HTML into clean, easy-to-read plain ASCII text. Better yet, that ASCII also happens to be valid Markdown (a text-to-HTML format).

Usage: python html2text.py [(filename|url) [encoding]]

Originally written by Aaron Swartz. This code is distributed under the GPLv3.