/TreeHTMLParse

A web page screen scraper, based on Java and XPath

Primary LanguageJava

TreeHTMLParse

This program created using Java and Netbeans framework. To compile, "cd" to the source directory and run the command: ant jar

To run the program, simply "cd" to the directory where the file TreeHTMLParse.jar located (or in the ./dist directory within the source code) and execure the command: java -jar TreeHTMLParse.jar

dependecy: HTML2XHTML http://www.it.uc3m.es/jaf/html2xhtml/