#blog-scraping-tool

  • Version: 1.0

Description

Scraping tool best suited for Wordpress and Blogger blog text extraction from HTML. Custom platform support depends on markup. Tool developed as part of Bachelor thesis in Latvian University - "Latvian language corpus creation from blog texts" by Mārtiņš Laizāns.

Built using Fuelphp (http://www.fuelphp.com/), WAMP Server (http://www.wampserver.com/en/) and Netbeans IDE (https://netbeans.org/).