TencentAILab-CVC

Tencent AI Lab - Computer Vision Center

Pinned Repositories

CV-VAE
[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
Language:Jupyter Notebook268 14 189
FreeNoise
[ICLR 2024] Code for FreeNoise based on VideoCrafter
Language:Python399 6 1826
GPT4Tools
GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the user to interact with images during a conversation.
Language:Python768 13 1758
SEED
Official implementation of SEED-LLaMA (ICLR 2024).
Language:Python602 16 5133
SEED-Bench
(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
Language:Python332 4 2913
SEED-X
Multimodal Models in Real World
Language:Jupyter Notebook449 18 3220
TaleCrafter
[SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters
261 24 712
UniRepLKNet
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
Language:Python974 13 2058
VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Language:Python4.8k 71 85364
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Language:Python5.2k 47 521499

TencentAILab-CVC's Repositories

AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Language:Python5.2k 47 521499
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Language:Python4.8k 71 85364
AILab-CVC/UniRepLKNet
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
Language:Python974 13 2058
AILab-CVC/GPT4Tools
GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the user to interact with images during a conversation.
Language:Python768 13 1758
AILab-CVC/SEED
Official implementation of SEED-LLaMA (ICLR 2024).
Language:Python602 16 5133
AILab-CVC/SEED-X
Multimodal Models in Real World
Language:Jupyter Notebook449 18 3220
AILab-CVC/FreeNoise
[ICLR 2024] Code for FreeNoise based on VideoCrafter
Language:Python399 6 1826
AILab-CVC/SEED-Bench
(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
Language:Python332 4 2913
AILab-CVC/CV-VAE
[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
Language:Jupyter Notebook268 14 189
AILab-CVC/TaleCrafter
[SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters
261 24 712
AILab-CVC/Animate-A-Story
Retrieval-Augmented Video Generation for Telling a Story
256 23 319
AILab-CVC/VideoGen-Eval
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
212 13 89
AILab-CVC/Make-Your-Video
[IEEE TVCG 2024] Customized Video Generation Using Textual and Structural Guidance
Language:Python191 15 48
AILab-CVC/GroupMixFormer
GroupMixAttention and GroupMixFormer
Language:Python115 9 412
AILab-CVC/M2PT
[CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
Language:Python99 7 25
AILab-CVC/VL-GPT
VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation
85 19 22
AILab-CVC/HiFi-123
[ECCV 2024] HiFi-123: Towards High-fidelity One Image to 3D Content Generation
Language:Python67 12 11
AILab-CVC/AILab-CVC.github.io
Homepage of Tencent AI Lab CVC.
Language:HTML0 7 00

TencentAILab-CVC

Pinned Repositories

CV-VAE

FreeNoise

GPT4Tools

SEED

SEED-Bench

SEED-X

TaleCrafter

UniRepLKNet

VideoCrafter

YOLO-World

TencentAILab-CVC's Repositories

AILab-CVC/YOLO-World

AILab-CVC/VideoCrafter

AILab-CVC/UniRepLKNet

AILab-CVC/GPT4Tools

AILab-CVC/SEED

AILab-CVC/SEED-X

AILab-CVC/FreeNoise

AILab-CVC/SEED-Bench

AILab-CVC/CV-VAE

AILab-CVC/TaleCrafter

AILab-CVC/Animate-A-Story

AILab-CVC/VideoGen-Eval

AILab-CVC/Make-Your-Video

AILab-CVC/GroupMixFormer

AILab-CVC/M2PT

AILab-CVC/VL-GPT

AILab-CVC/HiFi-123

AILab-CVC/AILab-CVC.github.io