/WebScraper

JAVA based web scraper - collects results from given sources using implemented selectors for each website template. You can define proxy servers or user agent for each selector to act as a specific user. Goal is to allow easy extending modules for traversing through many websites by implementing proper Selector. HTML navigating managed by JSoup. Package includes also ProxyFinder which download active proxies adresses from defined proxy selectors, so you needn't search by yourself.

Primary LanguageHTML

Issues