mscoco

There are 56 repositories under mscoco topic.

microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Language:Python13.2k 127 3032k
sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Language:Python2.7k 24 187714
SwinTransformer/Swin-Transformer-Object-Detection
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
Language:Python1.8k 22 217372
apple/ml-cvnets
CVNets: A library for training computer vision networks
Language:Python1.7k 33 92217
peteanderson80/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Language:Jupyter Notebook1.4k 26 116378
HRNet/HRNet-Object-Detection
Object detection with multi-level representations generated from deep high-resolution representation learning (HRNetV2h). This is an official implementation for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919
Language:Python641 16 5397
JDAI-CV/CoTNet
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
Language:Python507 10 3276
sacmehta/EdgeNets
This repository contains the source code of our work on designing efficient CNNs for computer vision
Language:Python409 20 3782
hyz-xmaster/VarifocalNet
VarifocalNet: An IoU-aware Dense Object Detector
Language:Python345 9 3552
hyz-xmaster/swa_object_detection
SWA Object Detection
Language:Python246 4 1826
ViTAE-Transformer/ViTAE-Transformer
The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"
Language:Python246 5 1828
MichiganCOG/ViP
Video Platform for Action Recognition and Object Detection in Pytorch
Language:Python220 15 3737
YehLi/ImageNetModel
Official ImageNet Model repository
Language:Jupyter Notebook194 5 1231
hustvl/BMaskR-CNN
[ECCV 2020] Boundary-preserving Mask R-CNN
Language:Python186 13 2641
peteanderson80/SPICE
Semantic Propositional Image Caption Evaluation
Language:Java130 5 1431
HRNet/HRNet-FCOS
High-resolution Networks for the Fully Convolutional One-Stage Object Detection (FCOS) algorithm
Language:Python124 7 637
ntrang086/image_captioning
generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset
Language:Python71 6 442
610265158/mobilenetv3_centernet
A tensorflow implement mobilenetv3 centernet, which can be easily deployeed on android(MNN) and ios(CoreML).
Language:Python70 9 1213
Weed-AI/Weed-AI
A repository to support the development of a repository and interchange format for weed identification annotation
Language:Python51 9 3846
peteanderson80/coco-caption
Adds SPICE metric to coco-caption evaluation server codes
Language:Jupyter Notebook50 7 642
lightly-ai/labelformat
A tool for converting computer vision label formats.
Language:Python45 5 63
oswaldoludwig/visually-informed-embedding-of-word-VIEW-
Visually informed embedding of word (VIEW) is a tool for transferring multimodal background knowledge to NLP algorithms.
Language:Python30 3 011
utahnlp/consistency
Implementation of models in our EMNLP 2019 paper: A Logic-Driven Framework for Consistency of Neural Models
Language:Python30 4 13
ayansengupta17/GAN
We aim to generate realistic images from text descriptions using GAN architecture. The network that we have designed is used for image generation for two datasets: MSCOCO and CUBS.
Language:HTML20 4 110
gautamchitnis/cocoapi
Clone of COCO API - Dataset @ http://cocodataset.org/ - with changes to support Windows build and python3
Language:Jupyter Notebook17 1 09
deepplants/ViT-PCM
Official implementation of "Max Pooling with Vision Transformers reconciles class and shape in weakly supervised semantic segmentation"
Language:Python16 2 31
leftthomas/DeepMask
A Keras implementation of DeepMask based on NIPS 2015 paper "Learning to Segment Object Candidates"
Language:Python15 4 26
howardyclo/ImageNet2COCO
A demo for mapping class labels from ImageNet to COCO.
Language:Jupyter Notebook10 3 02
CLT29/semantic_neighborhoods
Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval [ECCV 2020]
Language:Python9 3 16
jakarto3d/jakarnotator
The Jakarnotator is an annotation tool to create your own database for instance segmentation problem.
Language:JavaScript7 6 00
nayeem8527/Chitra-VarNan
Hindi Image Captioning
Language:Python7 3 21
canesee-project/Arabic-COCO
MS COCO captions in Arabic
6 2 01
VladimirSinitsin/labelme_converter
LabelMe to MsCOCO, PascalVOC, Yolo
Language:Python6 1 21
biyoml/PyTorch-SSD
PyTorch implementation of SSD: Single Shot MultiBox Detector.
Language:Python5 1 01
Lukeasargen/Show-Attend-and-Tell-Pytorch-Lightning
Encoder-Decoder CNN-LSTM Model with an attention mechanism for image captioning. Trained using the Microsoft COCO Dataset.
Language:Jupyter Notebook5 1 20
shunk031/huggingface-datasets_COCOA
COCOA: Semantic Amodal Segmentation for huggingface datasets
Language:Python4 2 0

mscoco

microsoft/Swin-Transformer

sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning

SwinTransformer/Swin-Transformer-Object-Detection

apple/ml-cvnets

peteanderson80/bottom-up-attention

HRNet/HRNet-Object-Detection

JDAI-CV/CoTNet

sacmehta/EdgeNets

hyz-xmaster/VarifocalNet

hyz-xmaster/swa_object_detection

ViTAE-Transformer/ViTAE-Transformer

MichiganCOG/ViP

YehLi/ImageNetModel

hustvl/BMaskR-CNN

peteanderson80/SPICE

HRNet/HRNet-FCOS

ntrang086/image_captioning

610265158/mobilenetv3_centernet

Weed-AI/Weed-AI

peteanderson80/coco-caption

lightly-ai/labelformat

oswaldoludwig/visually-informed-embedding-of-word-VIEW-

utahnlp/consistency

ayansengupta17/GAN

gautamchitnis/cocoapi

deepplants/ViT-PCM

leftthomas/DeepMask

howardyclo/ImageNet2COCO

CLT29/semantic_neighborhoods

jakarto3d/jakarnotator

nayeem8527/Chitra-VarNan

canesee-project/Arabic-COCO

VladimirSinitsin/labelme_converter

biyoml/PyTorch-SSD

Lukeasargen/Show-Attend-and-Tell-Pytorch-Lightning

shunk031/huggingface-datasets_COCOA