Pinned Repositories
activitynet-qa
An VideoQA dataset based on the videos from ActivityNet
bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
bottom-up-features
Bottom-up features extractor implemented in PyTorch.
caffe
Caffe: a fast open framework for deep learning.
finetune-transformer-lm
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
FPN
Feature Pyramid Networks for Object Detection
mcan-vqa
Deep Modular Co-Attention Networks for Visual Question Answering
openvqa
A lightweight, scalable, and general framework for visual question ansering (VQA) research
pytorch-pretrained-BERT
A PyTorch implementation of Google AI's BERT model with script to load Google's pre-trained models
vqa-mfb
yuzcccc's Repositories
yuzcccc/vqa-mfb
yuzcccc/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
yuzcccc/pytorch-pretrained-BERT
A PyTorch implementation of Google AI's BERT model with script to load Google's pre-trained models
yuzcccc/openvqa
A lightweight, scalable, and general framework for visual question ansering (VQA) research
yuzcccc/bottom-up-features
Bottom-up features extractor implemented in PyTorch.
yuzcccc/finetune-transformer-lm
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
yuzcccc/FPN
Feature Pyramid Networks for Object Detection
yuzcccc/mcan-vqa
Deep Modular Co-Attention Networks for Visual Question Answering
yuzcccc/activitynet-qa
An VideoQA dataset based on the videos from ActivityNet
yuzcccc/caffe
Caffe: a fast open framework for deep learning.
yuzcccc/DDPN
Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding
yuzcccc/Deformable-ConvNets-caffe
Deformable Convolutional Networks on caffe
yuzcccc/Detectron
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
yuzcccc/Detectron.pytorch
A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available.
yuzcccc/py-R-FCN-multiGPU
Code for training py-faster-rcnn and py-R-FCN on multiple GPUs in caffe
yuzcccc/pythia
A modular framework for Visual Question Answering research by the FAIR A-STAR team
yuzcccc/R-FCN-PSROIAlign
A Caffe implementation of PSROI-Align
yuzcccc/seg_every_thing
Code release for R. Hu et al., Learning to Segment Every Thing. in CVPR, 2018.