gt732's Stars
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
ryanontheinside/ComfyUI_RyanOnTheInside
Particle systems! Optical flow! Temporal masks! For ComfyUI!
MrForExample/ComfyUI-3D-Pack
An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)
cocktailpeanut/fluxgym
Dead simple FLUX LoRA training UI with LOW VRAM support
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
kijai/ComfyUI-FluxTrainer
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
TheMistoAI/MistoControlNet-Flux-dev
ControlNet collections for Flux1-dev model, Trained by TheMisto.ai Team
11cafe/comfyui-workspace-manager
A ComfyUI workflows and models management extension to organize and manage all your workflows, models in one place. Seamlessly switch between workflows, as well as import, export workflows, reuse subworkflows, install models, browse your models in a single workspace
kijai/ComfyUI-ControlNeXt-SVD
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
dezi-ai/ComfyUI-AnimateLCM
ComfyUI Custom Node for AnimateLCM
nerdyrodent/AVeryComfyNerd
ComfyUI related stuff and things
kijai/ComfyUI-champWrapper
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
lizhe00/AnimatableGaussians
Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"
guoyww/AnimateDiff
Official implementation of AnimateDiff.
360CVGroup/FancyVideo
This is the official reproduction of FancyVideo.
henrique-galimberti/i2v-workflow
Image-2-Video Workflow
Kosinkadink/ComfyUI-AnimateDiff-Evolved
Improved AnimateDiff for ComfyUI and Advanced Sampling Support
Fictiverse/ComfyUI_Fictiverse_Workflows
ComfyUI Workflows
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
BadToBest/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
unconv/plagiarist-gpt
Fine-tune ChatGPT to write lyrics of your favorite artist
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
SociallyIneptWeeb/AICoverGen
A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.
erew123/alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
KoljaB/RealtimeTTS
Converts text to speech in realtime
vocodedev/vocode-core
🤖 Build voice-based LLM agents. Modular + open source.