P3rturbator's Stars
fpgaminer/joycaption
JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.
ChenyangSi/FreeU
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
felixtaubner/cap4d
Official repository for the paper "CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models"
dendenxu/fast-gaussian-rasterization
A geometry-shader-based, global CUDA sorted high-performance 3D Gaussian Splatting rasterizer. Can achieve a 5-10x speedup in rendering compared to the vanialla diff-gaussian-rasterization.
Lightricks/LTX-Video
Official repository for LTX-Video
microsoft/TRELLIS
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
huanngzh/MV-Adapter
[768 Resolution] [Any "SDXL" Model] [Various Conditions] [Arbitrary Views] Official impl. of "MV-Adapter: Multi-view Consistent Image Generation Made Easy"
VectorSpaceLab/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
ant-research/MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Layer-norm/comfyui-lama-remover
a simple lama remover
adieyal/sd-dynamic-prompts
A custom script for AUTOMATIC1111/stable-diffusion-webui to implement a tiny template language for random prompt generation
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
rhasspy/piper
A fast, local neural text to speech system
facefusion/facefusion
Industry leading face manipulation platform
PowerHouseMan/ComfyUI-AdvancedLivePortrait
volotat/Anagnorisis
Local recommendation system
ostris/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
RootKit-Org/AI-Aimbot
World's Best AI Aimbot - CS2, Valorant, Fortnite, APEX, every game
D3voz/joy-caption-alpha-two-gui-mod
joy-caption-alpha-two -cli mod and gui mod
devilismyfriend/StableTuner
Finetuning SD in style.
konstmish/prodigy
The Prodigy optimizer and its variants for training neural networks.
TheJoeFin/Simple-QR-Code-Maker
Generate and Read QR Codes on Windows in this simple elegant app
Sanster/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
XLabs-AI/x-flux
EnragedAntelope/youtube-screenshot-extractor
Dataset helper for loras or checkpoints! Download YouTube videos, extract highest-available-quality screenshots, auto filter for aesthetics, and more!
xlinx/sd-webui-decadetw-auto-prompt-llm
sd-webui-auto-prompt-llm
cocktailpeanut/fluxgym
Dead simple FLUX LoRA training UI with LOW VRAM support
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment