latent-diffusion

There are 90 repositories under latent-diffusion topic.

  • InvokeAI

    invoke-ai/InvokeAI

    Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.

    Language:TypeScript24.6k2073.3k2.5k
  • IOPaint

    Sanster/IOPaint

    Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

    Language:Python20.6k1464862.1k
  • yl4579/StyleTTS2

    StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

    Language:Python5.5k79222511
  • leejet/stable-diffusion.cpp

    Stable Diffusion and Flux in pure C/C++

    Language:C++3.9k57344351
  • discoart

    jina-ai/discoart

    🪩 Create Disco Diffusion artworks in one line

    Language:Python3.8k34107249
  • Dreambooth-Stable-Diffusion

    JoePenna/Dreambooth-Stable-Diffusion

    Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

    Language:Jupyter Notebook3.2k39109555
  • Stability-AI/stability-sdk

    SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)

    Language:Jupyter Notebook2.4k68113344
  • carefree-creator

    carefree0910/carefree-creator

    AI magics meet Infinite draw board.

    Language:Jupyter Notebook1.9k7237180
  • lucidrains/naturalspeech2-pytorch

    Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

    Language:Python1.3k5431104
  • Uminosachi/sd-webui-inpaint-anything

    Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

    Language:Python1.2k14140108
  • anapnoe/stable-diffusion-webui-ux

    Stable Diffusion web UI UX

    Language:Python1k1518961
  • teticio/audio-diffusion

    Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

    Language:Jupyter Notebook746164473
  • SkyWorkAIGC/SkyPaint-AI-Diffusion

    基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本,可生成多种现代艺术风格的高质量图像。| An optimized text-to-image model based on Stable Diffusion. Both Chinese and English text inputs are available to generate images. The model can generate high-quality images in several modern art styles.

  • Text-to-Audio/Make-An-Audio

    PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

    Language:Python633521486
  • DiffusionFastForward

    mikonvergence/DiffusionFastForward

    DiffusionFastForward: a free course and experimental framework for diffusion-based generative models

    Language:Jupyter Notebook6269860
  • nihaomiao/CVPR23_LFDM

    The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"

    Language:Python460104842
  • atfortes/Awesome-Controllable-Diffusion

    Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.

  • dailenson/One-DM

    Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation

    Language:Python364103834
  • magnusviri/InvokeAI

    About Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.

    Language:TypeScript3468035
  • parlance-zz/g-diffuser-bot

    Discord bot and Interface for Stable Diffusion

    Language:Python27997421
  • symisc/tiny-dream

    Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation

    Language:C25715711
  • inpaint-anything

    Uminosachi/inpaint-anything

    Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

    Language:Python25162331
  • cobanov/awesome-diffusion

    A curated list of awesome Diffusion notebooks, tools, software, tutorials and resources.

  • explainingai-code/StableDiffusion-PyTorch

    This repo implements a Stable Diffusion model in PyTorch with all the essential components.

    Language:Python18933239
  • ai-forever/KandinskyVideo

    KandinskyVideo — multilingual end-to-end text2video latent diffusion model

    Language:Python18213719
  • apapiu/transformer_latent_diffusion

    Text to Image Latent Diffusion using a Transformer core

    Language:Python1706926
  • Simple_Prompt_Generator

    WiNE-iNEFF/Simple_Prompt_Generator

    Simple prompt generator for Midjourney, DALLe, Stable and Disco Diffusion, Flux and etc.

    Language:Python1503118
  • steve-zeyu-zhang/MotionMamba

    🔥 [ECCV 2024] Motion Mamba: Efficient and Long Sequence Motion Generation

    Language:JavaScript1311404
  • TokenCompose

    mlpc-ucsd/TokenCompose

    (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision

    Language:Jupyter Notebook1203104
  • BarqueroGerman/BeLFusion

    [ICCV2023] Official PyTorch Implementation of "BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction". ICCV 2023

    Language:Python11651310
  • kiranchhatre/amuse

    [CVPR 2024] AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion

    Language:Python10711135
  • WASasquatch/easydiffusion

    Easy Diffusion is an advanced Stable Diffusion Notebook with a feature rich image processing suite.

    Language:Jupyter Notebook10271417
  • olaviinha/NeuralImageSuperResolution

    Colabs for Neural Image Enhancement.

    Language:Jupyter Notebook974317
  • kabachuha/InfiNet

    Implementation of DiffusionOverDiffusion architecture presented in NUWA-XL in a form of ControlNet-like module on top of ModelScope text2video model for extremely long video generation.

    Language:Python86787
  • navervision/CompoDiff

    Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)

    Language:Python83883
  • koninik/WordStylist

    Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023

    Language:Python7331010