/mrdbscan_experimental

An implementation of DBSCAN runing on top of Apache Spark

Primary LanguageScalaApache License 2.0Apache-2.0

DO NOT USE THIS - PURELY FOR dDBGSCAN COMPARISON TESTING

DBSCAN on Spark

This is a fork from irvingc/dbscan-on-spark. This is a version for benchmarking VS other implementations. DO NOT USE THIS.

Overview

This is an implementation of the DBSCAN clustering algorithm on top of Apache Spark. It is loosely based on the paper from He, Yaobin, et al. "MR-DBSCAN: a scalable MapReduce-based DBSCAN algorithm for heavily skewed data".

License

DBSCAN on Spark is available under the Apache 2.0 license. See the LICENSE file for details.

Credits

DBSCAN on Spark is maintained by Irving Cordova (irving@irvingc.com). Forked for and slightly modified for benchmarking by Daniel Marcous (dmarcous@gmail.com).