terry-r123's Stars
terry-r123/Awesome-Captioning
A curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)
sokrypton/ColabFold
Making Protein folding accessible to all!
deepmodeling/Uni-Fold
FangShancheng/ABINet
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
Hangz-nju-cuhk/Talking-Face_PC-AVS
Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)
facebookresearch/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
aimagelab/meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
facebookresearch/grid-feats-vqa
Grid features pre-training code for visual question answering
albanie/collaborative-experts
Video embeddings for retrieval with natural language queries
microsoft/Oscar
Oscar and VinVL
DTaoo/Discriminative-Sounding-Objects-Localization
Code for Discriminative Sounding Objects Localization (NeurIPS 2020)
karpathy/neuraltalk2
Efficient Image Captioning code in Torch, runs on GPU
karpathy/neuraltalk
NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
krantiparida/awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
LividWo/Revisit-MMT
fortunechen/Awesome-Visual-Captioning
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
ruotianluo/self-critical.pytorch
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
twairball/fairseq-zh-en
NMT for chinese-english using fairseq
NeuronDance/DeepRL
Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone