Pinned Repositories
.vim
vim config for my vim
3DFuse
Official implementation of "Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation"
accord-net-extensions
Advanced image processing and computer vision algorithms made as fluent extensions and built for portability
activityrecognition
Information about activity recognition
data-efficient-gans
[NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Training
face-detection
My Face Detection application written in Matlab.
KittiSeg
A Kitti Road Segmentation model implemented in tensorflow.
mnn_mtcnn_cpp
mnn based mtcnn c++ realize.
RobustVideoMatting
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
UnderController's Repositories
UnderController/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
UnderController/ailia-models
Pretrained models for ailia SDK
UnderController/BrushNet
The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
UnderController/clarity-upscaler
Clarity-Upscaler: Reimagined image upscaling for everyone
UnderController/ComfyUI-Fluxtapoz
Nodes for image juxtaposition for Flux in ComfyUI
UnderController/ComfyUI-MochiWrapper
UnderController/CosmicMan
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
UnderController/EasyControl
Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"
UnderController/ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
UnderController/Glyph-ByT5
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering""
UnderController/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
UnderController/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
UnderController/IDM-VTON
IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
UnderController/IDM-VTON-train
UnderController/Kolors
Kolors Team
UnderController/LLM4GEN
UnderController/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
UnderController/MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
UnderController/MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
UnderController/MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level MLLM on Your Phone
UnderController/MoMA
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
UnderController/moondream
tiny vision language model
UnderController/MS-Diffusion
Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
UnderController/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
UnderController/Omost
Your image is almost there!
UnderController/RB-Modulation
Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"
UnderController/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
UnderController/sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
UnderController/TransPixar
UnderController/VMix
Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control