A toolkit for clustering web pages based on various similarity measures.
Primary LanguageJavaApache License 2.0Apache-2.0