isaac-bj's Stars
lihxxx/DisPose
This repository is the official implementation of "DisPose: Disentangling Pose Guidance for Controllable Human Image Animation"
nftblackmagic/catvton-flux
heshengtao/comfyui_LLM_party
LLM Agent Framework in ComfyUI includes Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as o1,ollama, gemini, grok, qwen, GLM, deepseek, moonshot,doubao. Adapted to local llms, vlm, gguf such as llama-3.3, Linkage graphRAG / RAG
if-ai/ComfyUI-IF_MemoAvatar
Memory-Guided Diffusion for Expressive Talking Video Generation
Vision-CAIR/LongVU
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
antgroup/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
luciddreamer-cvlab/LucidDreamer
Official code for the paper "LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes".
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
ali-vilab/In-Context-LoRA
Official repository of In-Context LoRA for Diffusion Transformers
dockur/windows
Windows inside a Docker container.
kijai/ComfyUI-PyramidFlowWrapper
bytedance/X-Portrait
Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"
akatz-ai/ComfyUI-X-Portrait-Nodes
Wrapper for X-Portrait for running in ComfyUI
wenqsun/DimensionX
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
hyz317/StdGEN
M4Singer/M4Singer
Huanshere/VideoLingo
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
anliyuan/Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
lldacing/ComfyUI_BiRefNet_ll
ZhengPeng7/BiRefNet
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
TEN-framework/TEN-Agent
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatible with popular workflow platforms like Dify and Coze.
smthemex/ComfyUI_InstantIR_Wrapper
You can InstantIR to upsacel image in ComfyUI ,InstantIR,Blind Image Restoration with Instant Generative Reference
logtd/ComfyUI-MochiEdit
ComfyUI nodes to edit videos using Genmo Mochi
smthemex/ComfyUI_Sapiens
You can call Using Sapiens to get seg,normal,pose,depth,mask
HelloVision/ComfyUI_HelloMeme
Official comfyui repository of Hellomeme
kijai/ComfyUI-MochiWrapper
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Jonseed/ComfyUI-Detail-Daemon
A port of muerrilla's sd-webui-Detail-Daemon as a node for ComfyUI, to adjust sigmas that control detail.
Hillobar/Rope
GUI-focused roop