alex4727

Seoul National UniversitySeoul

alex4727's Stars

adobe-research/MagicFixup
Language:Python15110
showlab/DragAnything
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
Language:Python47817
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python14.2k1.5k
Jiayi-Pan/TinyZero
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Language:Python10.8k1.4k
deepseek-ai/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
Language:Python16.5k2.2k
ZiyuGuo99/Image-Generation-CoT
Investigating CoT Reasoning in Autoregressive Image Generation
Language:Python50519
tgxs002/HPSv2
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
Language:Jupyter Notebook45516
deepseek-ai/DeepSeek-V3
Language:Python90.2k14.5k
huggingface/open-r1
Fully open reproduction of DeepSeek-R1
Language:Python21.8k1.9k
DAMO-NLP-SG/VideoLLaMA3
Frontier Multimodal Foundation Models for Image and Video Understanding
Language:Jupyter Notebook56535
sihyun-yu/REPA
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)
Language:Python84540
cure-lab/PnPInversion
[ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"
Language:Jupyter Notebook29213
guanyingc/cv_rebuttal_template
Language:TeX23920
MRzzm/HDTF
the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"
Language:Python36969
Tencent/Hunyuan3D-2
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Language:Python6.6k503
zacharyhorvitz/Fk-Diffusion-Steering
A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.
Language:Jupyter Notebook925
wyhsirius/LIA
[ICLR 22, TPAMI 24] LIA: Latent Image Animator
Language:Python61666
harlanhong/awesome-talking-head-generation
1.6k119
JosephPai/Awesome-Talking-Face
📖 A curated list of resources dedicated to talking face.
1.5k117
Lightricks/LTX-Video
Official repository for LTX-Video
Language:Python2.9k253
xdit-project/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Language:Python1.4k113
brownvc/R3GAN
Code for NeurIPS 2024 paper - The GAN is dead; long live the GAN! A Modern Baseline GAN - by Huang et al.
Language:Python67722
abinthomasonline/repo2txt
Web-based tool converts GitHub repository contents into a single formatted text file
Language:JavaScript1.1k121
OpenBMB/MiniCPM-o
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Language:Python18.7k1.3k
TIGER-AI-Lab/AnyV2V
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)
Language:Jupyter Notebook55341
microsoft/TRELLIS
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
Language:Python8.1k616
Vchitect/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Language:Python78843
pkunlp-icler/FastV
[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
Language:Python36814
dvlab-research/VisionZip
Official repository for VisionZip (CVPR 2025)
Language:Python24110
bytedance/1d-tokenizer
This repo contains the code for 1D tokenizer and generator
Language:Jupyter Notebook69737

alex4727

alex4727's Stars

adobe-research/MagicFixup

showlab/DragAnything

KwaiVGI/LivePortrait

Jiayi-Pan/TinyZero

deepseek-ai/Janus

ZiyuGuo99/Image-Generation-CoT

tgxs002/HPSv2

deepseek-ai/DeepSeek-V3

huggingface/open-r1

DAMO-NLP-SG/VideoLLaMA3

sihyun-yu/REPA

cure-lab/PnPInversion

guanyingc/cv_rebuttal_template

MRzzm/HDTF

Tencent/Hunyuan3D-2

zacharyhorvitz/Fk-Diffusion-Steering

wyhsirius/LIA

harlanhong/awesome-talking-head-generation

JosephPai/Awesome-Talking-Face

Lightricks/LTX-Video

xdit-project/xDiT

brownvc/R3GAN

abinthomasonline/repo2txt

OpenBMB/MiniCPM-o

TIGER-AI-Lab/AnyV2V

microsoft/TRELLIS

Vchitect/VBench

pkunlp-icler/FastV

dvlab-research/VisionZip

bytedance/1d-tokenizer