DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
PythonBSD-3-Clause
Stargazers
- fly51flyPRIS
- GaozhongpaiUnited Imaging Intelligence
- GeneZC
- gmftbyGMFTBYBeijing Institute of Technology
- guanzhchenSun Yat-sen University
- hefeng1995
- HenryHZYLaVi Lab led by Prof. Liwei Wang @ CSE, CUHK
- hysts
- jetwu-create
- JinjikikoWuhan University
- jinzhuoranNEU & CASIA
- kenchan0226Singapore
- kugwzk
- Kun1Qi
- lgcming
- liuxuebo0SenseTime
- lixin4ever@alibaba
- lwaekfjlkCKC@ZJU -> LTI@CMU -> CS@UIUC
- marktube
- MingfengXue
- pixeli99DUT IIAU
- quartets
- ringosHong Kong
- slyviacassell
- spc121DAMO Academy|XJTU
- Sun-Yi-Heng
- sunyuhan19981208Sensetime & Zhejiang University
- tensorboyTikTok Inc
- UCLWilsonSpacetime Lab, UCL
- xuhzyy
- xujamesNew York
- yangkexinSichuan university
- yenanfeiHuawei
- Zhang-EachZhejiang University
- zwhe99Shanghai Jiao Tong University
- zwhus