/boilerpipe

Automatically exported from code.google.com/p/boilerpipe

Primary LanguageJava

Boilerpipe

Automatically exported from http://code.google.com/p/boilerpipe, and then maintained a bit with manual wiki-extraction with some edits.

How to build and use

To build run:

ant

To use, run:

java -jar /path/to/boilerpipe-core/dist/boilerpipe-1.2-dev.jar ./example.html out.txt

Old QuickStart

Boilerpipe is an HTML content extraction tool. Check out QuickStart.