JerryChiao's Stars
xai-org/grok-1
Grok open release
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
black-forest-labs/flux
Official inference repo for FLUX.1 models
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
sczhou/CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
sebastianstarke/AI4Animation
Bringing Characters to Life with Computer Brains in Unity
Acly/krita-ai-diffusion
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
lllyasviel/Omost
Your image is almost there!
lllyasviel/IC-Light
More relighting!
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Doubiiu/ToonCrafter
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
MooreThreads/Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
AiuniAI/Unique3D
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
QianMo/PBR-White-Paper
⚡️基于物理的渲染(PBR)白皮书 | White Paper of Physically Based Rendering(PBR)
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
rohitgandikota/sliders
Concept Sliders for Precise Control of Diffusion Models
PeterGriffinJin/Awesome-Language-Model-on-Graphs
A curated list of papers and resources based on "Large Language Models on Graphs: A Comprehensive Survey" (TKDE)
kvablack/ddpo-pytorch
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
ingra14m/Awesome-Inverse-Rendering
A collection of papers on neural field-based inverse rendering.
jianghd1996/Camera-control
This project explores the opportunities of deep learning for camera control in virtual cinematography.