This is a fork from irvingc/dbscan-on-spark. This is a version for benchmarking VS other implementations. DO NOT USE THIS.
This is an implementation of the DBSCAN clustering algorithm on top of Apache Spark. It is loosely based on the paper from He, Yaobin, et al. "MR-DBSCAN: a scalable MapReduce-based DBSCAN algorithm for heavily skewed data".
DBSCAN on Spark is available under the Apache 2.0 license. See the LICENSE file for details.
DBSCAN on Spark is maintained by Irving Cordova (irving@irvingc.com). Forked for and slightly modified for benchmarking by Daniel Marcous (dmarcous@gmail.com).