Visual Product Recognition Challenge

My solution for Aicrowd Visual Product Recognition Challenge. The goal is doing a visual search over e-commerce products.

The training and most of the hyper parameters are taken from paper, which is the authors of the repo I fork.

Structure

main branch is where all of my experiments are if you are interested some of my ideas you can check on it however they are not documented so you need to play around.
aicrowd branch is my final solution.

Install CUDA.

Note: VIT-H is a huge model you need at least 24GB VRAM to run the experiments.
pip install -r requirements.txt

Download the datasets and unzip them into their respective folders.

To install the amazan dataset run;

cd amazon_dataset_1
python download_meta_data.py
python download_images.py

Note: You might need to run the scripts multiple times

To run the weight ensemble see link.

The trained models can be found in huggingface.