/mask_rcnn_pytorch

Mask RCNN in PyTorch

Primary LanguagePythonMIT LicenseMIT

Mask RCNN in PyTorch

  1. This is a PyTorch implementation of Mask RCNN which attempts to reproduce the results in Mask RCNN.
  2. This project provides an implementation of ROI Align in CUDA C as well as a PyTorch nn.Module for it.
  3. The model architecture is based on the awesome Faster RCNN with PyTorch repo.
  4. The experiment setup is based on Image Classification Project Killer in Pytorch to minimize the effort of doing experiments and developing new models.

Spectial thanks to Fast Mask RCNN for being the catalyst of this project.

Progress (with expected time)

  • ROIAlign layer (It is not explained in the paper how four regular locations are chosen, so currently, this implementation uses 4 corners of each bin. This may be changed either after hyper-parameter tuning stage or after the release of the original code.)
  • COCO dataloader with mask
  • FastRCNN with ROIAlign using ResNet-50-C4
  • Training code
  • Test the FastRCNN with ROI Algin (In progress)
  • MaskRCNN using ResNet-50-C4 (4/12)
  • FPN backbone (4/14)
  • Testing all code
  • Turing hyper-parameters
  • Considering removing dependency of numpy and cython
  • Speed up
  • Clean the code and comment
  • Pretrained models
  • Demo code