zyayoung's Stars
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
NVIDIA/pix2pixHD
Synthesizing and manipulating 2048x1024 images with conditional GANs
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
princeton-vl/infinigen
Infinite Photorealistic Worlds using Procedural Generation
hzwer/ECCV2022-RIFE
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
facebookresearch/theseus
A library for differentiable nonlinear optimization
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
CPJKU/madmom
Python audio and music signal processing library
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
chengxuxin/extreme-parkour
[ICRA 2024]: Train your parkour robot in less than 20 hours.
mir-aidj/all-in-one
All-In-One Music Structure Analyzer
aigc-apps/CogVideoX-Fun
📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.
zsyzzsoft/co-mod-gan
[ICLR 2021, Spotlight] Large Scale Image Completion via Co-Modulated Generative Adversarial Networks
AIFSH/ComfyUI-MimicMotion
a comfyui custom node for MimicMotion
kijai/ComfyUI-MimicMotionWrapper
YihangChen-ee/HAC
:house: [ECCV 2024] Pytorch implementation of 'HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression'
YoungSeng/DiffuseStyleGesture
DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ entry to the GENEA Challenge 2023 (ICMI 2023, Reproducibility Award)
SJTU-ViSYS/M2DGR-plus
Extension and update of M2DGR: a novel Multi-modal and Multi-scenario SLAM Dataset for Ground Robots (ICRA2022 & ICRA2024)
huy-ha/pybullet-blender-recorder
MCG-NJU/SGM-VFI
[CVPR 2024] Sparse Global Matching for Video Frame Interpolation with Large Motion
zengxianyu/co-mod-gan-pytorch
pytorch implementation of the paper ``Large Scale Image Completion via Co-Modulated Generative Adversarial Networks"
YihangChen-ee/CNC
:tada: [CVPR 2024] Pytorch implementation of 'Har Far Can We Compress Instant-NGP Based NeRF?'
WayneMao/PillarNeSt
The Official Implementation of PillarNeSt
ZHZisZZ/weak-to-strong-search
[NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
XYPB/CLEFT
Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MICCAI 2024.
servuskk/ColorDiff-Image
MULTIMODAL SEMANTIC-AWARE AUTOMATIC COLORIZATION WITH DIFFUSION PRIOR
eternalthinker/kami-solver
Solution finder for KAMI (2) game on IOS/Android
zyayoung/WeChatPersona
LLM agents that mimic your friends.