BeautifulSoup4 packaged into a command line tool. =================================================

For now this tool just parses HTML tag soup with BeautifulSoup4, and writes out the results. This can help for example to properly parse the structure of a Netscape bookmarks file, which omits many ending tags.

Installation ---------

From the Python package index (Pypi):

(sudo) pip install beautifulsoup4-slurp

or from Github:

git clone https://github.com/peterhil/slurp.git cd slurp (sudo) python setyp.py install

Usage

Show help:

slurp -h

Parse with html5lib and pretty print into stdout:

slurp -i bookmarks.html -p 'html5lib' -y

Parse with lxml and pretty print into stdout:

slurp -i bookmarks.html -p 'lxml' -y

Write pretty-printed to output to file:

slurp -y -i bookmarks.html -o bookmarks_soup.html

Pipe into slurp:

echo '<title>Slurp!</title><p><a href="https://github.com/peterhil/slurp/">Github</a>' | slurp -y

License ------

For the full copyright and license information, please view the LICENSE file that was distributed with this source code.