Pter61
Ph.D. student in Institute of Information Engineering, Chinese Academy of Sciences, interest in Vision-Language Alignment.
UCASBeijing
Pinned Repositories
AlignCMSS
ban-vqa
Bilinear attention networks for visual question answering
context-i2w
Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]
CSSummerCamp2020
关于2020年CS保研夏令营的汇总。欢迎大家分享夏令营信息,资瓷一下互联网精神吼不吼啊?
denoise-i2w-tmm
dynamic_fusion_reimplementation
Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering
hello-world-actions
image-captioning-bottom-up-top-down
PyTorch implementation of Image captioning with Bottom-up, Top-down Attention
VL-BERT
Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".
vlpmarker
Pter61's Repositories
Pter61/context-i2w
Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]
Pter61/vlpmarker
Pter61/AlignCMSS
Pter61/denoise-i2w-tmm
Pter61/VL-BERT
Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".
Pter61/ban-vqa
Bilinear attention networks for visual question answering
Pter61/CSSummerCamp2020
关于2020年CS保研夏令营的汇总。欢迎大家分享夏令营信息,资瓷一下互联网精神吼不吼啊?
Pter61/dynamic_fusion_reimplementation
Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering
Pter61/hello-world-actions
Pter61/image-captioning-bottom-up-top-down
PyTorch implementation of Image captioning with Bottom-up, Top-down Attention
Pter61/MUSE
A library for Multilingual Unsupervised or Supervised word Embeddings
Pter61/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Pter61/Up-Down-Captioner
Automatic image captioning model based on Caffe, using features from bottom-up attention.
Pter61/Pter61