AK391's Stars
lllyasviel/ControlNet
Let us control diffusion models!
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
deep-floyd/IF
openai/point-e
Point cloud diffusion for 3D model synthesis
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
showlab/Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Picsart-AI-Research/Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
ali-vilab/composer
Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"
Shiriluz/Word-As-Image
pix2pixzero/pix2pix-zero
Zero-shot Image-to-Image Translation [SIGGRAPH 2023]
microsoft/MM-REACT
Official repo for MM-REACT
MichalGeyer/plug-and-play
Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)
ali-vilab/videocomposer
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability
KU-CVLAB/3DFuse
Official implementation of "Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation"
csyxwei/ELITE
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)
G-U-N/AnimateLCM
AnimateLCM: Let's Accelerate the Video Generation within 4 Steps!
kohjingyu/fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
jwkirchenbauer/lm-watermarking
mkshing/svdiff-pytorch
Implementation of "SVDiff: Compact Parameter Space for Diffusion Fine-Tuning"
dvlab-research/Video-P2P
Video-P2P: Video Editing with Cross-attention Control
bryandlee/Tune-A-Video
Unofficial implementation of Tune-A-Video
arXiv/arxiv-browse
Flask app for article abstract and listing pages
orpatashnik/local-prompt-mixing
allenai/WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
vzakharov/jukebox-webui
Google Colab-backed Web UI for creating music with OpenAI Jukebox
Expedit-LargeScale-Vision-Transformer/Expedit-SAM
[NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning" for SAM.
da03/markup2im
Diffusion-based markup-to-image generation
nitrosocke/diffusers-webui
This is a Gradio WebUI working with the Diffusers format of Stable Diffusion