/arrumator

A simple HTML beautifier.

Primary LanguagePythonDo What The F*ck You Want To Public LicenseWTFPL

arrumator.py

It's beautifulsoup4 with four spaces as tabs and empty tags kept in one line. Oh, and an optional --tidy parameter.

It's not on pypi, so

$ pip install git+git://github.com/ravishi/arrumator.git
$ arrumator.py dirtydocument.html > cleandocument.html

If you're dealing with really dirty HTML you may want to try tidy.

$ sudo apt-get install tidy
$ arrumator.py --tidy reallydirtydocument.html

Development

If you have any tip, improvement or request, you are encouraged to make me know.