wendashi's Stars
Xiaojiu-z/SSR_Encoder
Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)
pcottle/learnGitBranching
An interactive git visualization and tutorial. Aspiring students of git can use this app to educate and challenge themselves towards mastery of git!
Yuanshi9815/Subjects200K
Subjects200K dataset
Yuanshi9815/OminiControl
A minimal and universal controller for FLUX.1.
alimama-creative/FLUX-Controlnet-Inpainting
instantX-research/Regional-Prompting-FLUX
Training-free Regional Prompting for Diffusion Transformers 🔥
ali-vilab/In-Context-LoRA
Official repository of In-Context LoRA for Diffusion Transformers
openai/consistency_models
Official repo for consistency models.
NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
salesforce/UniControl
Unified Controllable Visual Generation Model
logtd/ComfyUI-Fluxtapoz
Nodes for image juxtaposition for Flux in ComfyUI
Shenyi-Z/ToCa
Accelerating Diffusion Transformers with Token-wise Feature Caching
bytedance/TextHarmony
The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation
EnVision-Research/OmniBooth
wdndev/mllm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识
wdndev/llm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
LituRout/RF-Inversion
Rectified Flow Inversion (RF-Inversion)
sihyun-yu/REPA
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
zyxElsa/ProSpect
Official implementation of the paper "ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models"(SIGGRAPH Asia 2023)
zyxElsa/InST
Official implementation of the paper “Inversion-Based Style Transfer with Diffusion Models” (CVPR 2023)
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
YuxinWenRick/hard-prompts-made-easy
mlfoundations/open_clip
An open source implementation of CLIP.
oss-roettger/T5-Textual-Inversion
Textual Inversion for DeepFloyd IF
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
ShareGPT4Omni/ShareGPT4V
[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions
aadhithya/onnx-typecast
Script to typecast ONNX model parameters from INT64 to INT32.
cardinalblue/ArtAdapter
Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation
GongyeLiu/StyleCrafter
[SIGGRAPH Asia 2024 (Journal Track)]StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter