sunkinux's Stars
shiimizu/ComfyUI-TiledDiffusion
Tiled Diffusion, MultiDiffusion, Mixture of Diffusers, and optimized VAE
ssitu/ComfyUI_UltimateSDUpscale
ComfyUI nodes for the Ultimate Stable Diffusion Upscale script by Coyote-A.
kijai/ComfyUI-CCSR
ComfyUI wrapper node for CCSR
florestefano1975/ComfyUI-HiDiffusion
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
unit-mesh/unit-minions
《AI 研发提效:自己动手训练 LoRA》,包含 Llama (Alpaca LoRA)模型、ChatGLM (ChatGLM Tuning)相关 Lora 的训练。训练内容:用户故事生成、测试代码生成、代码辅助生成、文本转 SQL、文本生成代码……
hahnec/color-matcher
automatic color-grading
kijai/ComfyUI-KJNodes
Various custom nodes for ComfyUI
ZHO-ZHO-ZHO/ComfyUI-AnyText
Unofficial implementation of AnyText for ComfyUI(EXP)
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
ShineChen1024/MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
lllyasviel/Omost
Your image is almost there!
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
smthemex/ComfyUI_Llama3_8B
Llama3_8B for comfyUI, using pipeline workflow
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
twri/sdxl_prompt_styler
Custom prompt styler node for SDXL in ComfyUI
linyiLYi/bilibot
A local chatbot fine-tuned by bilibili user comments.
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
mattjaybe/sd-wildcards
A collection of wildcards for Stable Diffusion
encord-team/text-to-image-eval
Evaluate custom and HuggingFace text-to-image/zero-shot-image-classification models like CLIP, SigLIP, DFN5B, and EVA-CLIP. Metrics include Zero-shot accuracy, Linear Probe, Image retrieval, and KNN accuracy.
gokayfem/ComfyUI_VLM_nodes
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
TIGER-AI-Lab/Mantis
Official code for Paper "Mantis: Multi-Image Instruction Tuning" (TMLR2024)
gokayfem/awesome-vlm-architectures
Famous Vision Language Models and Their Architectures
TheMistoAI/ComfyUI-Anyline
Anyline: A Fast, Accurate, and Detailed Line Detection Preprocessor
TheMistoAI/MistoLine
A Versatile and Robust SDXL-ControlNet Model for Adaptable Line Art Conditioning
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
bytedance/res-adapter
Official implementation of "ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models".
Kiteretsu77/APISR
APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)