/webcrawler

The main objective of the project is to crawl web pages, store them, remove noise in them and finally verify the Zipf’s law distribution with the obtained noise-free content.

Primary LanguageJavaMIT LicenseMIT

Stargazers

No one’s star this repository yet.