latent-diffusion
There are 90 repositories under latent-diffusion topic.
invoke-ai/InvokeAI
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
Sanster/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
leejet/stable-diffusion.cpp
Stable Diffusion and Flux in pure C/C++
jina-ai/discoart
🪩 Create Disco Diffusion artworks in one line
JoePenna/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.
Stability-AI/stability-sdk
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
carefree0910/carefree-creator
AI magics meet Infinite draw board.
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Uminosachi/sd-webui-inpaint-anything
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
anapnoe/stable-diffusion-webui-ux
Stable Diffusion web UI UX
teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
SkyWorkAIGC/SkyPaint-AI-Diffusion
基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本,可生成多种现代艺术风格的高质量图像。| An optimized text-to-image model based on Stable Diffusion. Both Chinese and English text inputs are available to generate images. The model can generate high-quality images in several modern art styles.
Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
mikonvergence/DiffusionFastForward
DiffusionFastForward: a free course and experimental framework for diffusion-based generative models
nihaomiao/CVPR23_LFDM
The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"
atfortes/Awesome-Controllable-Diffusion
Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.
dailenson/One-DM
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
magnusviri/InvokeAI
About Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
parlance-zz/g-diffuser-bot
Discord bot and Interface for Stable Diffusion
symisc/tiny-dream
Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation
Uminosachi/inpaint-anything
Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
cobanov/awesome-diffusion
A curated list of awesome Diffusion notebooks, tools, software, tutorials and resources.
explainingai-code/StableDiffusion-PyTorch
This repo implements a Stable Diffusion model in PyTorch with all the essential components.
ai-forever/KandinskyVideo
KandinskyVideo — multilingual end-to-end text2video latent diffusion model
apapiu/transformer_latent_diffusion
Text to Image Latent Diffusion using a Transformer core
WiNE-iNEFF/Simple_Prompt_Generator
Simple prompt generator for Midjourney, DALLe, Stable and Disco Diffusion, Flux and etc.
steve-zeyu-zhang/MotionMamba
🔥 [ECCV 2024] Motion Mamba: Efficient and Long Sequence Motion Generation
mlpc-ucsd/TokenCompose
(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision
BarqueroGerman/BeLFusion
[ICCV2023] Official PyTorch Implementation of "BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction". ICCV 2023
kiranchhatre/amuse
[CVPR 2024] AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion
WASasquatch/easydiffusion
Easy Diffusion is an advanced Stable Diffusion Notebook with a feature rich image processing suite.
olaviinha/NeuralImageSuperResolution
Colabs for Neural Image Enhancement.
kabachuha/InfiNet
Implementation of DiffusionOverDiffusion architecture presented in NUWA-XL in a form of ControlNet-like module on top of ModelScope text2video model for extremely long video generation.
navervision/CompoDiff
Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)
koninik/WordStylist
Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023