image-generation

There are 2204 repositories under image-generation topic.

  • AUTOMATIC1111/stable-diffusion-webui

    Stable Diffusion web UI

    Language:Python156k1.2k7.8k29k
  • LocalAI

    mudler/LocalAI

    :robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

    Language:Go35.3k2321.1k2.8k
  • khoj

    khoj-ai/khoj

    Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

    Language:Python31k1535531.8k
  • huggingface/diffusers

    🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

    Language:Python30.7k2205k6.3k
  • InvokeAI

    invoke-ai/InvokeAI

    Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.

    Language:TypeScript25.9k2033.5k2.7k
  • junyanz/pytorch-CycleGAN-and-pix2pix

    Image-to-Image Translation in PyTorch

    Language:Python24.5k3441.5k6.5k
  • Graphite

    GraphiteEditor/Graphite

    An open source graphics editor for 2025: comprehensive 2D content creation tool suite for graphic design, digital art, and interactive real-time motion graphics — featuring node-based procedural editing

    Language:Rust21.1k1251.2k893
  • camenduru/stable-diffusion-webui-colab

    stable diffusion webui colab

    Language:Jupyter Notebook15.9k1963612.7k
  • junyanz/CycleGAN

    Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.

    Language:Lua12.7k3871492k
  • satori

    vercel/satori

    Enlightened library to convert HTML and CSS to SVG

    Language:TypeScript12.4k77315300
  • phillipi/pix2pix

    Image-to-image translation with conditional adversarial nets

    Language:Lua10.5k3192111.7k
  • neural-doodle

    alexjc/neural-doodle

    Turn your two-bit doodles into fine artworks with deep neural networks, generate seamless textures from photos, transfer style from one image to another, perform example-based upscaling, but wait... there's more! (An implementation of Semantic Style Transfer.)

    Language:Python9.9k3160904
  • FoundationVision/VAR

    [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

    Language:Jupyter Notebook8.4k104159539
  • dream-textures

    carson-katri/dream-textures

    Stable Diffusion built-in to Blender

    Language:Python8.1k113551437
  • PaddlePaddle/PaddleGAN

    PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.

    Language:Python8.1k1073661.3k
  • jamez-bondos/awesome-gpt4o-images

    Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capabilities.

    Language:JavaScript7.3k1.2k
  • open-mmlab/mmagic

    OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

    Language:Jupyter Notebook7.3k967091.1k
  • OpenGVLab/DragGAN

    Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)

    Language:Python5k65113489
  • pkuliyi2015/multidiffusion-upscaler-for-automatic1111

    Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0

    Language:Python5k49319349
  • StableSwarmUI

    Stability-AI/StableSwarmUI

    StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.

    Language:C#4.9k68374395
  • KnpLabs/snappy

    PHP library allowing thumbnail, snapshot or PDF generation from a url or a html page. Wrapper for wkhtmltopdf/wkhtmltoimage

    Language:PHP4.5k131273437
  • leejet/stable-diffusion.cpp

    Diffusion model(SD,Flux,Wan,...) inference in pure C/C++

    Language:C++4.4k59356420
  • VectorSpaceLab/OmniGen

    OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

    Language:Jupyter Notebook4.3k86168363
  • ali-vilab/AnyDoor

    Official implementations for paper: Anydoor: zero-shot object-level image customization

    Language:Python4.2k86107367
  • Janspiry/Image-Super-Resolution-via-Iterative-Refinement

    Unofficial implementation of Image Super-Resolution via Iterative Refinement by Pytorch

    Language:Python3.8k65140479
  • Dreambooth-Stable-Diffusion

    JoePenna/Dreambooth-Stable-Diffusion

    Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

    Language:Jupyter Notebook3.2k39109553
  • SwarmUI

    mcmonkeyprojects/SwarmUI

    SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.

    Language:C#3.1k25515291
  • crmne/ruby_llm

    One beautiful Ruby API for OpenAI, Anthropic, Gemini, Bedrock, OpenRouter, DeepSeek, Ollama, VertexAI, Perplexity, Mistral, GPUStack & OpenAI compatible APIs. Chat, Vision, Audio, PDF, Images, Embeddings, Tools, Streaming & Rails integration.

    Language:Ruby2.9k21185254
  • pwa-asset-generator

    elegantapp/pwa-asset-generator

    Automates PWA asset generation and image declaration. Automatically generates icon and splash screen images, favicons and mstile images. Updates manifest.json and index.html files with the generated images according to Web App Manifest specs and Apple Human Interface guidelines.

    Language:TypeScript2.9k19164152
  • ai-forever/Kandinsky-2

    Kandinsky 2 — multilingual text2image latent diffusion model

    Language:Jupyter Notebook2.8k4790312
  • bytedance/InfiniteYou

    🔥 [ICCV 2025 Highlight] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

    Language:Python2.6k2125131
  • taesungp/contrastive-unpaired-translation

    Contrastive unpaired image-to-image translation, faster and lighter training than cyclegan (ECCV 2020, in PyTorch)

    Language:Python2.4k33173435
  • Awesome-Text-to-Image

    Yutong-Zhou-cv/Awesome-Text-to-Image

    (ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

  • lshqqytiger/stable-diffusion-webui-amdgpu

    Stable Diffusion web UI

    Language:Python2.2k29449224
  • Mukosame/Anime2Sketch

    A sketch extractor for anime/illustration.

    Language:Python2.1k2525171
  • ComfyUI-to-Python-Extension

    pydn/ComfyUI-to-Python-Extension

    A powerful tool that translates ComfyUI workflows into executable Python code.

    Language:Python2k1596179