TorchCV: a PyTorch vision library mimics ChainerCV

Detection

Model	Original Paper	ChainerCV	TorchCV*
SSD300@voc07_test	74.3%	77.8%	76.68%
SSD512@voc07_test	76.8%	79.2%	78.89%
FPNSSD512@voc07_test	-	-	81.46%

The accuracy of TorchCV SSD is ~1% lower than ChainerCV. This is because the VGG base model I use performs slightly worse.
I did the experiment by replacing pytorch/vision VGG16 model with the model used in ChainerCV, the SSD512 model got 79.85% accuracy.

FPNSSD512 is created by replacing SSD VGG16 network with FPN50, the rest is the same. It beats all SSD models.
You can download the trained params here.

Update

Our FPNSSD512 model achieved the 1st place on the PASCAL VOC 2012 dataset (by 2018-2-6).

Check the learderboard.

aosokin/torchcv

TorchCV: a PyTorch vision library mimics ChainerCV

Detection

Update

TODO