Pinned Repositories
bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
coco-caption
faster_rcnn
Faster R-CNN
grad-cam
[ICCV 2017] Torch code for Grad-CAM
leetcode
LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)
leetcode-1
Provide all my solutions and explanations in Chinese for all the Leetcode coding problems.
MIA
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)
models
Models built with TensorFlow
multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
xianglinbuaa's Repositories
xianglinbuaa/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
xianglinbuaa/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
xianglinbuaa/coco-caption
xianglinbuaa/faster_rcnn
Faster R-CNN
xianglinbuaa/grad-cam
[ICCV 2017] Torch code for Grad-CAM
xianglinbuaa/leetcode
LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)
xianglinbuaa/leetcode-1
Provide all my solutions and explanations in Chinese for all the Leetcode coding problems.
xianglinbuaa/MIA
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)
xianglinbuaa/models
Models built with TensorFlow
xianglinbuaa/multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
xianglinbuaa/nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers
xianglinbuaa/object_relation_transformer
Implementation of the Object Relation Transformer for Image Captioning
xianglinbuaa/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
xianglinbuaa/Oscar
Oscar and VinVL
xianglinbuaa/py-bottom-up-attention
PyTorch bottom-up attention with Detectron2
xianglinbuaa/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
xianglinbuaa/pytorch-distributed
A quickstart and benchmark for pytorch distributed training.
xianglinbuaa/pytorch-faster-rcnn
pytorch1.0 updated. Support cpu test and demo. (Use detectron2, it's a masterpiece)
xianglinbuaa/scene-graph-TF-release
"Scene Graph Generation by Iterative Message Passing" code repository
xianglinbuaa/Semi-Supervised-Image-Captioning
Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"
xianglinbuaa/SGAE
xianglinbuaa/unsupervised_captioning
Code for Unsupervised Image Captioning
xianglinbuaa/Up-Down-Captioner
Automatic image captioning model based on Caffe, using features from bottom-up attention.