zbxzc35/ConvNets

research and implementations of CNNs and their applications

MIT

CNN Architectures

[LeNet(1998)] [paper]
[LeNet-5 (2010)] [paper]
[AlexNet (2012)] [paper]
[ZFNet(2013)] [paper]
[VGGNet (2014)] [paper]
[GoogleNet/Inception(2014)] [paper]
[FCN(2014)] [paper]
[RCNN(2014)] [paper]
[Deeply-supervised networks(2014)] [paper]
[ResNet(2015)] [paper]
[Ladder network(2015)] [paper]
[YOLO(2015)] [Paper]
[FractalNet (2016)] [paper]
[PolyNet/Inception-Residual(2016)] [paper]
[DenseNet(2016)] [paper] [code]
[SegNet(2016)] [paper]
[fast region based CNN(2016)] [paper]
[Look up based CNN(2016)] [paper]
[Deep network with stochastic depth(2016)] [paper]
[ResNeXt(2016)] [paper]
[SqueezeNet(2016)] [paper] [code]
[CapsNet(2017)] [paper]
[MobileNets(2017)] [paper]
[Xception(2017)] [paper]
[IRCNN(2017)][paper]
[ViP CNN(2017)] [paper]
[Squeeze-and-Excitation Networks(2017)][Paper] [code]
[MobileFaceNets(2018)] [paper]
[DCNet and DCNet++(2018)] [paper]

Applications

Object Recognition / Object Classification [SOTA]
Object Detection [SOTA]
Semantic Segmentation [SOTA]
Object Tracking [SOTA]
[Activity/Action Recognition] [SOTA]
[Face Recognition]
[Pose estimation]
[Video & Image Captioning]
[Biomedical Imaging] SOTA
[Remote Sensing]
[Video Analysis]
[3D Vision]
[CNNs for NLP]
[CNNs for Speech Processing]
[Adversarial Attacks on CNN] SOTA

Object Detection

[Light-Head R-CNN(2017)] [paper]
[Cascade R-CNN(2017)] [paper]
[YOLT(2018)] [paper]
[FSSD(2018)] [paper]
[ESSD] [paper]
[MDSSD(2018)] [paper]
[Pelee(2018)] [paper]
[Fire SSD(2018)] [paper]
[MegNet(2018)] [paper]
[DetNet(2018)] [paper]
[SSOD(2018)] [paper]
[CornerNet(2018)] [paper]
[3D Object Detection(2018)] [paper]
[ZSD（Zero-Shot Object Detection）(2018)] [paper]
[OSD（One-Shot object Detection）(2018)] [paper]
[Weakly Supervised Object Detection(2018)] [paper]
[Softer-NMS (2018)] [paper]
[VideoCapsuleNet(2018)] [paper]
[YOLO3D(2018)] [paper]

Semantic Segmentation

U-Net [arxiv][Pytorch]
SegNet [arxiv][Caffe]
DeepLab [arxiv][Caffe]
FCN [arxiv][tensorflow]
ENet [arxiv][Caffe]
LinkNet [arxiv][Torch]
DenseNet [arxiv[]
Tiramisu [arxiv]
DilatedNet [arxiv]
PixelNet [arxiv][Caffe]
ICNet [arxiv][Caffe]
ERFNet [arxiv][Torch]
RefineNet [arxiv][tensorflow]
PSPNet [arxiv,pspnet][Caffe]
Dilated convolution [arxiv][Caffe]
DeconvNet [arxiv][Caffe]
FRRN [arxiv][Lasagne]
GCN [arxiv][PyTorch]
LRR [arxiv][Matconvnet]
DUC, HDC [arxiv][PyTorch]
MultiNet [arxiv] [tensorflow1 tensorflow2]
Segaware [arxiv][Caffe]
Semantic Segmentation using Adversarial Networks [arxiv] [Chainer]
In-Place Activated BatchNorm:obtain #1 positions [arxiv] [Pytorch]
[SegCaps(2018)] [arxiv]
[SegFinNet(2018)] [arxiv]
[SUNets(2018)] [arxiv]

References

Maintainer

Gopala KR / @gopala-kr