jianlong-yuan

Interested in Dense Prediction, such as Depth Estimation and Semantic Segmentation

Alibaba-DAMObeijing

jianlong-yuan's Stars

chatanywhere/GPT_API_free
Free ChatGPT API Key，免费ChatGPT API，支持GPT4 API（免费），ChatGPT国内可用免费转发API，直连无需代理。可以搭配ChatBox等软件/插件使用，极大降低接口使用成本。国内即可无限制畅快聊天。
Language:Python26.4k 122 3182k
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python10.1k 126 488946
clappr/clappr
:clapper: An extensible media player for the web.
Language:JavaScript7.2k 235 1.5k855
facebookresearch/sapiens
High-resolution models for human tasks.
Language:Python4.7k 45 164270
baaivision/Emu3
Next-Token Prediction is All You Need
Language:Python1.9k 32 5477
siliconflow/onediff
OneDiff: An out-of-the-box acceleration library for diffusion models.
Language:Jupyter Notebook1.8k 39 466110
ZhengPeng7/BiRefNet
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
Language:Python1.5k 17 132117
Picsart-AI-Research/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Language:Python1.5k 43 58153
wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
1.4k 59 1966
menyifang/MIMO
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
1.4k 124 2655
mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
Language:Python1.2k 19 65151
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Language:Python1.1k 15 4746
Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Language:Python1.1k 29 5339
finegrain-ai/refiners
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
Language:Python778 14 1756
Vchitect/Vchitect-2.0
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Language:Python662 7 1318
magic-research/PLLaVA
Official repository for the paper PLLaVA
Language:Python623 15 7945
aigc-apps/CogVideoX-Fun
📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.
Language:Python588 8 9040
hehao13/CameraCtrl
Language:Python461 12 1720
csuldw/AntSpider
1000万豆瓣电影/评论/名人/评分数据采集源码分享（内含千万电影数据集，可下载）
Language:Python443 8 872
RunpeiDong/DreamLLM
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
Language:Python407 16 267
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Language:HTML398 15 522
aim-uofa/MovieDreamer
261 24 38
baaivision/EVE
[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models
Language:Python254 9 165
AILab-CVC/CV-VAE
[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
Language:Jupyter Notebook253 14 169
mbzuai-oryx/VideoGPT-plus
Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Language:Python236 5 2615
ai-forever/MoVQGAN
MoVQGAN - model for the image encoding and reconstruction
Language:Jupyter Notebook211 4 814
WHB139426/Grounded-Video-LLM
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Language:Python75 4 54
instantX-research/InstantUnify
InstantUnify: Integrates Multimodal LLM into Diffusion Models 🔥
39 13 11
FuchenUSTC/VideoStudio
Language:Python16 1 21
robincourant/the-exceptional-trajectories
Language:Python9 1 30

jianlong-yuan

jianlong-yuan's Stars

chatanywhere/GPT_API_free

THUDM/CogVideo

clappr/clappr

facebookresearch/sapiens

baaivision/Emu3

siliconflow/onediff

ZhengPeng7/BiRefNet

Picsart-AI-Research/StreamingT2V

wangkai930418/awesome-diffusion-categorized

menyifang/MIMO

mini-sora/minisora

showlab/Show-o

Drexubery/ViewCrafter

finegrain-ai/refiners

Vchitect/Vchitect-2.0

magic-research/PLLaVA

aigc-apps/CogVideoX-Fun

hehao13/CameraCtrl

csuldw/AntSpider

RunpeiDong/DreamLLM

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

aim-uofa/MovieDreamer

baaivision/EVE

AILab-CVC/CV-VAE

mbzuai-oryx/VideoGPT-plus

ai-forever/MoVQGAN

WHB139426/Grounded-Video-LLM

instantX-research/InstantUnify

FuchenUSTC/VideoStudio

robincourant/the-exceptional-trajectories