phellonchen
Institute of Automation, Chinese Academy of Sciences & University of Chinese Academy of Sciences
Beijing, China
phellonchen's Stars
yuyq96/TextHawk
Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
phellonchen/X-LLM
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
lemuria-wchen/DialogVED
Code and released pre-trained model for our ACL 2022 paper: "DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation"
MingLunHan/CIF-PyTorch
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).
MingLunHan/CIF-ColDec
[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
phellonchen/awesome-visual-dialog
Recent Advances in Visual Dialog
ZhenYangIACAS/WeTS
A benchmark for the task of translation suggestion
phellonchen/awesome-Vision-and-Language-Pre-training
Recent Advances in Vision and Language Pre-training (VLP)
dqqcasia/st
End-to-end Speech Translation
dqqcasia/awesome-speech-translation
davidnvq/visdial
Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
phellonchen/DMRM
DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog
yuleiniu/rva
Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"
ZiJianZhao/SeqGAN-PyTorch
A implementation of SeqGAN in PyTorch, following the implementation in tensorflow.