yakunpku

SenseTimeBeijing, China

yakunpku's Stars

wangjiangshan0725/RF-Solver-Edit
Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)
Language:Python1393
DmitryUlyanov/deep-image-prior
Image restoration with neural networks but without learning.
Language:Jupyter Notebook7.9k1.4k
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System
Language:Python43229
HelloVision/ComfyUI_HelloMeme
Official comfyui repository of Hellomeme
Language:Python885
HelloVision/HelloMeme
The official HelloMeme GitHub site
Language:Python1407
TIGER-AI-Lab/AnyV2V
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)
Language:Jupyter Notebook50738
instantX-research/Regional-Prompting-FLUX
Training-free Regional Prompting for Diffusion Transformers 🔥
Language:Python34312
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Language:Python3.9k614
kleinlee/MiniMates
The fastest digital human algorithm, now on your desktop.
Language:Python28324
alimama-creative/FLUX-Controlnet-Inpainting
Language:Python39329
facebookresearch/sapiens
High-resolution models for human tasks.
Language:Python4.5k250
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Jupyter Notebook7.6k571
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python35.4k4.3k
THUDM/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
Language:Python2.3k181
TachibanaYoshino/AnimeGAN
A Tensorflow implementation of AnimeGAN for fast photo animation ! This is the Open source of the paper 「AnimeGAN: a novel lightweight GAN for photo animation」, which uses the GAN framwork to transform real-world photos into anime images.
Language:Python4.5k663
kyutai-labs/moshi
Language:Python6.7k523
VectorSpaceLab/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Language:Jupyter Notebook2.6k197
serengil/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Language:Python14.7k2.2k
gpt-omni/mini-omni2
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
Language:Python1.6k185
OpenGVLab/MUTR
[AAAI 2024] Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation
Language:Python695
sihyun-yu/REPA
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Language:Python64730
conradry/copy-paste-aug
Copy-paste augmentation for segmentation and detection tasks
Language:Jupyter Notebook54673
OpenDriveLab/Vista
[NeurIPS 2024] A Generalizable World Model for Autonomous Driving
Language:Python56542
zchuz/CoT-Reasoning-Survey
[ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future
33312
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python9.1k856
Atten4Vis/CAE
This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"
Language:Python908
MingXiangL/DEVIL
Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].
Language:Python33643
HyperGAI/HPT
HPT - Open Multimodal LLMs from HyperGAI
Language:Python31318
xinntao/ESRGAN
ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.
Language:Python6k1.1k
xinntao/facexlib
FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.
Language:Python845148

yakunpku

yakunpku's Stars

wangjiangshan0725/RF-Solver-Edit

DmitryUlyanov/deep-image-prior

FireRedTeam/FireRedTTS

HelloVision/ComfyUI_HelloMeme

HelloVision/HelloMeme

TIGER-AI-Lab/AnyV2V

instantX-research/Regional-Prompting-FLUX

yisol/IDM-VTON

kleinlee/MiniMates

alimama-creative/FLUX-Controlnet-Inpainting

facebookresearch/sapiens

open-mmlab/Amphion

coqui-ai/TTS

THUDM/GLM-4-Voice

TachibanaYoshino/AnimeGAN

kyutai-labs/moshi

VectorSpaceLab/OmniGen

serengil/deepface

gpt-omni/mini-omni2

OpenGVLab/MUTR

sihyun-yu/REPA

conradry/copy-paste-aug

OpenDriveLab/Vista

zchuz/CoT-Reasoning-Survey

THUDM/CogVideo

Atten4Vis/CAE

MingXiangL/DEVIL

HyperGAI/HPT

xinntao/ESRGAN

xinntao/facexlib