facok's Stars
jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并支持api调用
KohakuBlueleaf/z-tipo-extension
A sd-webui extension for utilizing DanTagGen to "upsample prompts".
AIDC-AI/Ovis
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
serengil/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
dioxic/image-cropper
advimman/lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
google/RB-Modulation
Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"
deepghs/imgutils
A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models
kijai/ComfyUI-FluxTrainer
linoytsaban/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
XLabs-AI/x-flux
black-forest-labs/flux
Official inference repo for FLUX.1 models
HighCWu/control-lora-v3
ControlLoRA Version 3: LoRA Is All You Need to Control the Spatial Information of Stable Diffusion.
lrzjason/Comfyui-Kolors-Utils
Utils for kolors
OpenGVLab/OmniCorpus
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
lrzjason/T2ITrainer
Practice Code for text to image trainer
modelscope/DiffSynth-Studio
Enjoy the magic of Diffusion models!
facok/florence2-ft-simple
finetune your florence2 model easy
metercai/SimpleSDXL
Enhanced version of Fooocus for SDXL, more suitable for Chinese and Cloud
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Kwai-Kolors/Kolors
Kolors Team
fishaudio/fish-speech
Brand new TTS solution
facok/GPT-for-Annotation
Eagle Plugin
KohakuBlueleaf/LyCORIS
Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.
andimarafioti/florence2-finetuning
Quick exploration into fine tuning florence 2
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
roboflow/supervision
We write your reusable computer vision tools. 💜
bghira/SimpleTuner
A general fine-tuning kit geared toward diffusion models.
bdashore3/flash-attention
Fast and memory-efficient exact attention
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.