/siamesenetwork-tensorflow

Using siamese network to do dimensionality reduction and similar image retrieval

Primary LanguageJupyter NotebookMIT LicenseMIT

#Siamese Network Tensorflow Siamese network is a neural network that contain two or more identical subnetwork. The purpose of this network is to find the similarity or comparing the relationship between two comparable things. Unlike classification task that uses cross entropy as the loss function, siamese network usually uses contrastive loss or triplet loss.

This project follows Hadsell-et-al.'06 [1] by computing the Euclidean distance on the output of the shared network and by optimizing the contrastive loss (see paper for more details).

[1] "Dimensionality Reduction by Learning an Invariant Mapping" http://yann.lecun.com/exdb/publis/pdf/hadsell-chopra-lecun-06.pdf

Model

The input of these will be image_left, image_right and . Our model uses 5 layer of convolutional layer and pooling followed. We do not use fully convolutonal net because convolution operation is faster on GPU(especially using CUDNN). See http://cs231n.github.io/convolutional-networks/#convert for more information on converting FC layer to Conv layer.

Run

Train the model

git clone https://github.com/ardiya/siamesenetwork-tensorflow
python train.py

Tensorboard Visualization(After training)

tensorboard --logdir=train.log

Dimensionality reduction

Result on MNIST Dataset: See folder img to see the process until it converge, it is really fun to watch :)

Image retrieval

Image retrieval uses the trained model to extract the features and get the most similar image using cosine similarity. See here

Random image from test data

e.g id 865 in test image

Retrieved top n similar image from train data

e.g. return id [53144 47864 11074 51561 41350 34215 48182] from train data