P3rturbator

P3rturbator's Stars

fpgaminer/joycaption
JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.
Language:Python2686
ChenyangSi/FreeU
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
1.8k76
felixtaubner/cap4d
Official repository for the paper "CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models"
1329
dendenxu/fast-gaussian-rasterization
A geometry-shader-based, global CUDA sorted high-performance 3D Gaussian Splatting rasterizer. Can achieve a 5-10x speedup in rendering compared to the vanialla diff-gaussian-rasterization.
Language:Python50323
Lightricks/LTX-Video
Official repository for LTX-Video
Language:Python2.6k207
microsoft/TRELLIS
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
Language:Python6.7k457
huanngzh/MV-Adapter
[768 Resolution] [Any "SDXL" Model] [Various Conditions] [Arbitrary Views] Official impl. of "MV-Adapter: Multi-view Consistent Image Generation Made Easy"
Language:Python52131
VectorSpaceLab/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Language:Jupyter Notebook3.4k275
ant-research/MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Language:Python2.6k248
Layer-norm/comfyui-lama-remover
a simple lama remover
Language:Python10812
adieyal/sd-dynamic-prompts
A custom script for AUTOMATIC1111/stable-diffusion-webui to implement a tiny template language for random prompt generation
Language:Python2.1k272
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python74.3k8.9k
rhasspy/piper
A fast, local neural text to speech system
Language:C++7.4k546
facefusion/facefusion
Industry leading face manipulation platform
Language:Python21k3.2k
PowerHouseMan/ComfyUI-AdvancedLivePortrait
Language:Python2.1k178
volotat/Anagnorisis
Local recommendation system
Language:JavaScript9411
ostris/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
Language:Python3.8k428
RootKit-Org/AI-Aimbot
World's Best AI Aimbot - CS2, Valorant, Fortnite, APEX, every game
Language:Python1.5k299
D3voz/joy-caption-alpha-two-gui-mod
joy-caption-alpha-two -cli mod and gui mod
Language:Python583
devilismyfriend/StableTuner
Finetuning SD in style.
Language:Python67352
konstmish/prodigy
The Prodigy optimizer and its variants for training neural networks.
Language:Python36423
TheJoeFin/Simple-QR-Code-Maker
Generate and Read QR Codes on Windows in this simple elegant app
Language:C#645
Sanster/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Language:Python20.2k2.1k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python36.2k4.2k
XLabs-AI/x-flux
Language:Python1.8k128
EnragedAntelope/youtube-screenshot-extractor
Dataset helper for loras or checkpoints! Download YouTube videos, extract highest-available-quality screenshots, auto filter for aesthetics, and more!
Language:Python243
xlinx/sd-webui-decadetw-auto-prompt-llm
sd-webui-auto-prompt-llm
Language:Python578
cocktailpeanut/fluxgym
Dead simple FLUX LoRA training UI with LOW VRAM support
Language:Python1.7k169
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python4.2k256
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Language:Python3k210