Automatically exported from http://code.google.com/p/boilerpipe, and then maintained a bit with manual wiki-extraction with some edits.
To build run:
ant
To use, run:
java -jar /path/to/boilerpipe-core/dist/boilerpipe-1.2-dev.jar ./example.html out.txt
Boilerpipe is an HTML content extraction tool. Check out QuickStart.