Pinned Repositories
ACF-BCKD
A pytorch implenment for Attention Correction Feature and Boundary Constraint Knowledge Distillation for Efficient Medical Image Segmentation
AdaIN-Style-Transfer-PyTorch
A Pytorch implementation of Style Transfer with Adaptive Instance Normalization based on the paper Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization.
Adaptive-active-contour-model-driven-by-global-and-local-intensity
This my implementation of the paper Adaptive active contour model driven by global and local intensity fitting energy for image segmentation
AI-KD
Official repository of "AI-KD: Towards Alignment Invariant Face Image Quality Assessment Using Knowledge Distillation" presented at IWBF 2024.
AIGC-Digital-Human
Collections of papers, databases, and codes targeted at Digital Human
AnimateAnyone-SVD
AnimateAnyone_unofficial
Unofficial implementation of Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
aot-benchmark
An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch
APoT_Quantization
PyTorch implementation for the APoT quantization (ICLR 2020)
LoraControlnet
songyang86's Repositories
songyang86/blended-diffusion
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
songyang86/Celeb_reid
换装行人重识别(celeb_reid),可实现对特定人员的重识别检测
songyang86/ChronoMagic-Bench
ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
songyang86/DiffIR2VR-Zero
songyang86/Diffusion-Mamba
Implementation of Diffusion Mamba
songyang86/DiLightNet
Official Code Release for [SIGGRAPH 2024] DilightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
songyang86/DM-VTON
👗 DM-VTON: Distilled Mobile Real-time Virtual Try-On
songyang86/EAT_code
Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".
songyang86/FIFO-Diffusion_public
Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training
songyang86/GaussianTalker
Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim
songyang86/GeneFace
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
songyang86/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
songyang86/LaVie
LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
songyang86/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
songyang86/Person_reID_baseline_pytorch
:bouncing_ball_person: Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial
songyang86/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
songyang86/ReVersion
ReVersion: Diffusion-Based Relation Inversion from Images
songyang86/segmentation-guided-diffusion
[MICCAI 2024] Easy diffusion models with precise segmentation mask guidance.
songyang86/SelfReg-UNet
Code for the paper "SelfReg-UNet: Self-Regularized UNet for Medical Image Segmentation "
songyang86/SVD_Xtend
Stable Video Diffusion Training Code and Extensions.
songyang86/TalkingGaussian
[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting
songyang86/Tora
Official repo for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"
songyang86/TrackDiffusion
Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)
songyang86/Trajectory-Long-tail-Distribution-for-MOT
⭕️ Official codes for "Delving into the Trajectory Long-tail Distribution for Muti-object Tracking" (CVPR2024)
songyang86/VADER
songyang86/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
songyang86/Video-Infinity
Video-Infinity generates long videos quickly using multiple GPUs without extra training.
songyang86/VideoTetris
VideoTetris: Towards Compositional Text-To-Video Generation
songyang86/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
songyang86/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)