/topic-specific-webcrawler

Distributed Topic-specific Web crawler - Group of 3 (Java, Spark, Apache Storm, Hadoop)

Primary LanguageJava

topic-specific-webcrawler

cis555

Polite multithread webcrawler utilizing b+tree db. berkerly db.

able to retrive crawled document and present it as web throw webservice.

04210859adcd424b328f23f7a4d1916

d525babb3617393ecc920926fc82c29