facok

facok's Stars

jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并支持api调用
Language:Python10.2k1.1k
KohakuBlueleaf/z-tipo-extension
A sd-webui extension for utilizing DanTagGen to "upsample prompts".
Language:Python21415
AIDC-AI/Ovis
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
Language:Python34718
serengil/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Language:Python12k2k
dioxic/image-cropper
Language:Python4
advimman/lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Language:Jupyter Notebook7.9k837
google/RB-Modulation
Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"
Language:Jupyter Notebook32227
deepghs/imgutils
A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models
Language:Python17614
kijai/ComfyUI-FluxTrainer
Language:Python38317
linoytsaban/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Language:Python3
XLabs-AI/x-flux
Language:Python1.4k101
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python14.3k1k
HighCWu/control-lora-v3
ControlLoRA Version 3: LoRA Is All You Need to Control the Spatial Information of Stable Diffusion.
Language:Python17
lrzjason/Comfyui-Kolors-Utils
Utils for kolors
Language:Python181
OpenGVLab/OmniCorpus
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Language:Python2525
lrzjason/T2ITrainer
Practice Code for text to image trainer
Language:Python623
modelscope/DiffSynth-Studio
Enjoy the magic of Diffusion models!
Language:Python6.4k575
facok/florence2-ft-simple
finetune your florence2 model easy
Language:Python112
metercai/SimpleSDXL
Enhanced version of Fooocus for SDXL, more suitable for Chinese and Cloud
Language:Python57927
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python31.8k3.9k
Kwai-Kolors/Kolors
Kolors Team
Language:Python3.7k239
fishaudio/fish-speech
Brand new TTS solution
Language:Python12.8k958
facok/GPT-for-Annotation
Eagle Plugin
Language:Python181
KohakuBlueleaf/LyCORIS
Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.
Language:Python2.2k146
andimarafioti/florence2-finetuning
Quick exploration into fine tuning florence 2
Language:Jupyter Notebook25724
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python1.7k112
roboflow/supervision
We write your reusable computer vision tools. 💜
Language:Python22.7k1.7k
bghira/SimpleTuner
A general fine-tuning kit geared toward diffusion models.
Language:Python1.6k143
bdashore3/flash-attention
Fast and memory-efficient exact attention
Language:Python23320
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python25.4k5.3k