Pinned Repositories
comfyui-llm-toolkit
ComfyUI-IF_AI_tools
ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models.
ComfyUI-IF_AI_WishperSpeechNode
A convenient fast Text to Speech Whisper Speech by Collabora you can train a voice on the fly on ComfyUI
ComfyUI-IF_Gemini
New nANO-Banana Google Gemini API for ComfyUI generate images, transcribe audio, sumarize videos. Making a separate implemetation of my old IF_AI tools for easy installation
ComfyUI-IF_LLM
Run Local and API LLMs, Features Gemini2 image generation, DEEPSEEK R1, QwenVL2.5, QWQ32B, Ollama, LlamaCPP LMstudio, Koboldcpp, TextGen, Transformers or via APIs Anthropic, Groq, OpenAI, Google Gemini, Mistral, xAI and create your own charcters assistants (SystemPrompts) with custom presets
ComfyUI-IF_MemoAvatar
Memory-Guided Diffusion for Expressive Talking Video Generation
ComfyUI-IF_Trellis
ComfyUI TRELLIS is a large 3D asset generation in various formats, such as Radiance Fields, 3D Gaussians, and meshes. The cornerstone of TRELLIS is a unified Structured LATent (SLAT) representation that allows decoding to different output formats and Rectified Flow Transformers tailored for SLAT as the powerful backbones.
ComfyUI-IF_VideoPrompts
Creates prompts for Video Models by sequence analysis and prompting using Qwen2.5-VL models from Alibaba.
ComfyUI_HunyuanVideoFoley
HunyuanVideoFoley generates SFX audio to match your video and text prompt
IF_prompt_MKR
An A1111 extension to let the AI make prompts for SD using Oobabooga
if-ai's Repositories
if-ai/moondream
tiny vision language model
if-ai/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
if-ai/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
if-ai/RADIO
Official repository for "AM-RADIO: Reduce All Domains Into One"
if-ai/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
if-ai/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
if-ai/comfyui-deploy-next-example
A demo for running comfy deploy api via nextjs
if-ai/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
if-ai/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
if-ai/Friend
AI wearable with 24h+ battery
if-ai/LaVague
Automate automation with Large Action Model framework
if-ai/Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
if-ai/StreamMultiDiffusion
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
if-ai/InTeX
Interactive Text-to-Texture Synthesis via Unified Depth-aware Inpainting.
if-ai/ComfyUI-Flowty-CRM
This is a custom node that lets you use Convolutional Reconstruction Models right from ComfyUI.
if-ai/ComfyUI-3D-Pack
An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)
if-ai/CRM
Single Image to 3D Textured Mesh in 10 seconds with Convolutional Reconstruction Model.
if-ai/GaLore
if-ai/litegraph.js
A graph node engine and editor written in Javascript similar to PD or UDK Blueprints, comes with its own editor in HTML5 Canvas2D. The engine can run client side or server side using Node. It allows to export graphs as JSONs to be included in applications independently.
if-ai/plock
From anywhere you can type, query and stream the output of an LLM or any other script
if-ai/ComfyUI_Custom_Nodes_AlekPet
Custom nodes that extend the capabilities of Comfyui
if-ai/IF_prompt_MKR
An A1111 extension to let the AI make prompts for SD using Oobabooga
if-ai/EmoTalk_release
This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"
if-ai/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild
if-ai/LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
if-ai/Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
if-ai/ChatdollKit
ChatdollKit enables you to make your 3D model into a chatbot
if-ai/ComfyUI-Iterative-Mixer
Nodes that implement iterative mixing of samples to help with upscaling quality
if-ai/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
if-ai/ComfyUI-Inspire-Pack
This repository offers various extension nodes for ComfyUI. Nodes here have different characteristics compared to those in the ComfyUI Impact Pack. The Impact Pack has become too large now...