/torchcv

TorchCV: a PyTorch vision library mimics ChainerCV

Primary LanguagePythonMIT LicenseMIT

TorchCV: a PyTorch vision library mimics ChainerCV

Detection

Model Original Paper ChainerCV TorchCV*
SSD300@voc07_test 74.3% 77.8% 76.68%
SSD512@voc07_test 76.8% 79.2% 78.89%
FPNSSD512@voc07_test - - 81.46%

The accuracy of TorchCV SSD is ~1% lower than ChainerCV. This is because the VGG base model I use performs slightly worse.
I did the experiment by replacing pytorch/vision VGG16 model with the model used in ChainerCV, the SSD512 model got 79.85% accuracy.

FPNSSD512 is created by replacing SSD VGG16 network with FPN50, the rest is the same. It beats all SSD models.
You can download the trained params here.

Update

Our FPNSSD512 model achieved the 1st place on the PASCAL VOC 2012 dataset (by 2018-2-6).

image Check the learderboard.

TODO

  • SSD300
  • SSD512
  • FPNSSD512
  • YOLOV2
  • RetinaNet