aldoz-mila

MilaMontreal

Pinned Repositories

t2v_metrics_cogvlm
Evaluating text-to-image/video/3D models with VQAScore
Language:Python00
4D-Facial-Avatars
Dynamic Neural Radiance Fields for Monocular 4D Facial Avater Reconstruction
Language:Python681 29 6167
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.3k 158 1.5k2.2k
t2v_metrics
Evaluating text-to-image/video/3D models with VQAScore
Language:Python227 15 1120
Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python5.1k 49 450385
One2345plus
530 54 106
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
Language:Python6.1k 66 425417

aldoz-mila/t2v_metrics_cogvlm
Evaluating text-to-image/video/3D models with VQAScore
Language:Python00