g-jing's Stars
meta-llama/llama3
The official Meta Llama 3 GitHub site
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
LLaVA-VL/LLaVA-NeXT
google-deepmind/gemma
Open weights LLM from Google DeepMind.
genmoai/models
The best OSS video generation models
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
xiaobai1217/Awesome-Video-Datasets
Video datasets
autonomousvision/unimatch
[TPAMI'23] Unifying Flow, Stereo and Depth Estimation
KovenYu/WonderJourney
simular-ai/Agent-S
Agent S: an open agentic framework that uses computers like a human
OpenGVLab/VideoMAEv2
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
yzhang2016/video-generation-survey
A reading list of video generation
mira-space/MiraData
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
showlab/VideoSwap
Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
mira-space/Mira
tsujuifu/pytorch_mgie
A Gradio demo of MGIE
Ji4chenLi/t2v-turbo
Code repository for T2V-Turbo and T2V-Turbo-v2
eric-ai-lab/swap-anything
Official implementation of the ECCV paper "SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing"
songweige/content-debiased-fvd
[CVPR 2024] On the Content Bias in Fréchet Video Distance
jianzongwu/Language-Driven-Video-Inpainting
(CVPR 2024) Official code for paper "Towards Language-Driven Video Inpainting via Multimodal Large Language Models"
eric-ai-lab/via-video
WildVision-AI/LMM-Engines
WildVision-AI/WildVision-Arena
https://huggingface.co/spaces/WildVision/vision-arena