Deep Cluster

Deep cluster is a clustering technique that combines the Autoencoder and K-Means algorithms.

K-Means clustering is performed using a low dimensional latent-vector extracted from AE.

This provides better performance than K-Means clustering raw data.

Below are the results of benchmarking using the uncategorized MNIST and Fasion-MNIST datasets.

Things we tried that didn't work

Convolutional Autoencoder
- CAE was used to extract a better latent vector, but it was not as good as Dense AE.
Convolutional Encoder with Dense Decoder
- Convolution and pooling were repeated to expand the filter's receptive field, and eventually the latent vector was extracted through Global Average Pooling. Dense Decoder was used to restore the extracted late vector to the original image. But the result was not as good as Dense AE.
Dense Encoder with Convolutional Decoder
- Likewise, the result was worse than Dense AE.
Leaky or Parametric ReLU activation
- First, we used an active function with a negative gradient throughout the model.
- Second, we used an active function that had a negative gradient only at the output of the latent vector.
- The above two attempts resulted in worse results than using the Vanilla ReLU active function on all layers.