/bico

BICO: BIRCH Meets Coresets for k-Means Clustering

Primary LanguageC++GNU General Public License v3.0GPL-3.0

BICO

Code from the BICO website.

Getting Started

Use the pre-made Makefile in the bico/build directory to build the project:

make -C bico/build

Downloading Datasets

  • US Census Data (1990)

    mkdir -p data/raw
    mkdir -p data/results
    curl https://archive.ics.uci.edu/ml/machine-learning-databases/census1990-mld/USCensus1990.data.txt \
        --output data/raw/USCensus1990.data.txt
  • Covertype

    curl https://archive.ics.uci.edu/ml/machine-learning-databases/covtype/covtype.data.gz \
        --output data/raw/covtype.data.gz
  • Bag of Words Datasets

    curl https://archive.ics.uci.edu/ml/machine-learning-databases/bag-of-words/docword.enron.txt.gz \
        --output data/raw/docword.enron.txt.gz
  • Tower dataset

    curl http://homepages.uni-paderborn.de/frahling/instances/Tower.txt \
        --output data/raw/Tower.txt