zengyh1900's Stars
astral-sh/ruff
An extremely fast Python linter and code formatter, written in Rust.
karpathy/llm.c
LLM training in simple, raw C/CUDA
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
naver/dust3r
DUSt3R: Geometric 3D Vision Made Easy
ltdrdata/ComfyUI-Manager
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, this extension provides a hub feature and convenience functions to access a wide range of information within ComfyUI.
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
VAST-AI-Research/TripoSR
ZHO-ZHO-ZHO/ComfyUI-Workflows-ZHO
我的 ComfyUI 工作流合集 | My ComfyUI workflows collection
cubiq/ComfyUI_IPAdapter_plus
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
mhamilton723/FeatUp
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
ShineChen1024/MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
shubham-goel/4D-Humans
4DHumans: Reconstructing and Tracking Humans with Transformers
declare-lab/tango
A family of diffusion models for text-to-audio generation.
astral-sh/ruff-pre-commit
A pre-commit hook for Ruff.
IDKiro/sdxs
Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"
Tangshitao/MVDiffusion
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion, NeurIPS 2023 (spotlight)
NVlabs/edm2
Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)
CLAY-3D/OpenCLAY
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
ali-vilab/Ranni
mira-space/MiraData
postech-ami/Paint-it
[CVPR'24] Official PyTorch Implementation of "Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering"
ashawkey/kiuikit
A toolkit for 3D computer vision tasks.
Jeff-LiangF/FlowVid
luosiallen/Diff-Foley
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
zhizdev/mvdfusion
[CVPR 2024] MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation
RuoyuFeng/CCEdit
CCEdit: Creative and Controllable Video Editing via Diffusion Models
EnVision-Research/MotionInversion
junshutang/Make-It-Vivid
[CVPR 2024] Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text
aim-uofa/FreeCustom
[CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition