BDAProject2016

This is an example implementation of Bi-Clustering (also known as CoClustering)

The implementation is written in scala and uses Spark to work.

To begin, firt run loadData.py This will load sample data into Hive as a table called "dataset"

dev-d/Spark-CoClustering