This project summarize awesome papers related to efficient multimodal
- Minivlm: A smaller and faster vision-language model, Arxiv, Jianfeng Wang, Xiaowei Hu, Pengchuan Zhang, Xiujun Li, Lijuan Wang, Lei Zhang, Jianfeng Gao, Zicheng Liu
- Compressing Visual-linguistic Model via Knowledge Distillation, ICCV 2021, Zhiyuan Fang, Jianfeng Wang, Xiaowei Hu, Lijuan Wang, Yezhou Yang, Zicheng Liu
- Playing Lottery Tickets with Vision and Language, AAAI 2022, Zhe Gan, Yen-Chun Chen, Linjie Li, Tianlong Chen, Yu Cheng, Shuohang Wang, Jingjing Liu, Lijuan Wang, Zicheng Liu
- Multimodal Few-Shot Learning with Frozen Language Models NeurIPS2021, Maria Tsimpoukelli, Jacob Menick, Serkan Cabi, S. M. Ali Eslami, Oriol Vinyals, Felix Hill