Implementation of camera model identification system by team "[ods.ai] GPU_muscles" (2nd place overall in Kaggle competition IEEE's Signal Processing Society - Camera Model Identification and 1st place among student eligible teams).
Should any questions arise regarding the solution, please do not hesitate to contact me
on Telegram or via e-mail ikibardin@gmail.com
- Artur Kuzin [linkedin]
- Valeriy Babushkin [linkedin]
- Artur Fattakhov [kaggle]
- Ilya Kibardin [linkedin]
- Andrey Kiselev [linkedin]
To train models and get predictions the following is required:
- OS: Ubuntu 16.04
- Python 3.6
- Hardware:
- Any decent modern computer with x86-64 CPU,
- 32 GB RAM
- 4 x Nvidia GeForce GTX 1080 Ti
- Install required OS and Python
- Install packages with
pip install -r requirements.txt
- Create
data
folder at the root of the repository. Place train dataset from Kaggle competition todata/train
. Place test dataset from Kaggle competition todata/test
. Place additional validation images todata/val_images
. - Place
se_resnet50.pth
andse_resnext50.pth
toimagenet_pretrain
folder. - Place the following final weights to
final_weights
folder:densenet161_28_0.08377413648371115.pth
densenet161_55_0.08159203971706519.pth
densenet161_45_0.0813179751742137.pth
dpn92_tune_11_0.1398952918197271.pth
dpn92_tune_23_0.12260739478774665.pth
dpn92_tune_29_0.14363511492280367.pth
Run bash final_submit.sh -d <folder with test images> -o <output .csv filename>
This section describes the steps required to train our ensemble.
Images from both Yandex.Fotki and Flickr are essential for reproducing our solution.
Run bash download_from_yandex.sh
Unfortunately, this step involves some manual actions.
- cd into
downloader/flickr
- For every model go to the telephone model group page from
flickr_groups.txt
. Scroll every gallery page to the end and download as html file to the corresponding folder. As a result you will have a set of folders with .html files corresponding to a specific phone model athtml_pages
folder. - Run
python pages_to_image_links.py
. The result of the script will be folderlinks
of .csv files with links to photos of each phone model. - Run
python download_from_links.py
to download images from the links received in the previous paragraph (previous two steps could be skipped, because thelinks
folder already contains necessary files).
Run bash filter.sh
- Download and filter external dataset as described above.
- Run
bash init_train.sh
to train 9 models. - Run
bash make_pseudo.sh
to get predictions from these models for images atdata/test
and create pseudo labels. - Run
bash final_train.sh
to train the same 9 models but using pseudo labels this time. - Run
bash predict.sh -d <folder with test images> -o <output .csv filename>
to get predictions from the ensemble.