/WebCrawler

It findes images on a web page and saves to local

Primary LanguageJavaMIT LicenseMIT

WebCrawler

It findes images on a web page and saves to local

Folder base

This folder consists of files for testing main program

Folder crawler

This folder consists of:

  1. SimpleWebCrawler.java -- main class for parsing HTML page and downloading images
  2. WebCrawlerTest.java -- class for testing SimpleWebCrawler
  3. WebCrawlerBossTest.java -- class for testing SimpleWebCrawler
  4. Image.java -- class inclusive info about Image (url, data, pages)
  5. Page.java -- class inclusive info about Page (url, title, links, backlinks, images)
  6. Auxiliary files

base_cache.zip

This zip file consists of pages for testing WebCrawlerTest

kgeorgiy_cache.zip

This zip file consists of pages for testing WebCrawlerBossTest