caoyue10
Researcher at Beijing Academy of Artificial Intelligence (BAAI).
Beijing Academy of Artificial Intelligence.
caoyue10's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
lllyasviel/Fooocus
Focus on prompting and generating
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
roboflow/supervision
We write your reusable computer vision tools. 💜
Sanster/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Stability-AI/StableCascade
Official Code for Stable Cascade
openblocks-dev/openblocks
🔥 🔥 🔥 The Open Source Retool Alternative
pkuliyi2015/multidiffusion-upscaler-for-automatic1111
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
guoqincode/Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
openai/weak-to-strong
jasonjmcghee/rem
An open source approach to locally record and enable searching everything you view on your Mac.
openai/consistencydecoder
Consistency Distilled Diff VAE
PRIS-CV/DemoFusion
Let us democratise high-resolution generation! (CVPR 2024)
LC1332/Chat-Haruhi-Suzumiya
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
siliconflow/onediff
OneDiff: An out-of-the-box acceleration library for diffusion models.
Coyote-A/ultimate-upscale-for-automatic1111
NUS-HPC-AI-Lab/Neural-Network-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
KovenYu/WonderJourney
zhuzilin/ring-flash-attention
Ring attention implementation with flash attention
mazzzystar/disco-diffusion-wrapper
Implementation of disco-diffusion wrapper that could run on your own GPU with batch text input.
damian0815/compel
A prompting enhancement library for transformers-type text embedding systems
1003715231/gptstore-prompts
Here are the Top 100 prompts on GPTStore, which we can use to learn and improve prompt engineering.
YueWuHKUST/AniPortraitGAN
This is a pytorch implementation of the following paper: AniPortraitGAN: Animatable 3D Portrait Generation from 2D Image Collections, SIGGRAPH Asia 2023.
hzeyuan/OpenGPTS
OpenGPTs- Powerful GPTs Colipot | 强大的gpts浏览器插件|多窗口|批量对话|chatgpt3.5|chatgpt4.0