This is an example implementation of Bi-Clustering (also known as CoClustering)
The implementation is written in scala and uses Spark to work.
For an introduction to co-clustering, see http://scikit-learn.org/stable/modules/biclustering.html
For an introduction to Scala for Python developers, see https://bugra.github.io/work/notes/2014-10-18/scala-basics-for-python-developers/
To begin, firt run loadData.py
This will load sample data into Hive as a table called "dataset"