gt732

gt732's Stars

Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Language:Python1.7k138
ryanontheinside/ComfyUI_RyanOnTheInside
Particle systems! Optical flow! Temporal masks! For ComfyUI!
Language:Python1497
MrForExample/ComfyUI-3D-Pack
An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)
Language:Python2.2k223
cocktailpeanut/fluxgym
Dead simple FLUX LoRA training UI with LOW VRAM support
Language:Python89968
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
Language:Python1.5k116
kijai/ComfyUI-FluxTrainer
Language:Python38917
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python2.5k138
TheMistoAI/MistoControlNet-Flux-dev
ControlNet collections for Flux1-dev model, Trained by TheMisto.ai Team
Language:Python2638
11cafe/comfyui-workspace-manager
A ComfyUI workflows and models management extension to organize and manage all your workflows, models in one place. Seamlessly switch between workflows, as well as import, export workflows, reuse subworkflows, install models, browse your models in a single workspace
Language:TypeScript1.1k49
kijai/ComfyUI-ControlNeXt-SVD
Language:Python1375
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook5.1k331
dezi-ai/ComfyUI-AnimateLCM
ComfyUI Custom Node for AnimateLCM
Language:Python1606
nerdyrodent/AVeryComfyNerd
ComfyUI related stuff and things
1.2k93
kijai/ComfyUI-champWrapper
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Language:Python20610
lizhe00/AnimatableGaussians
Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"
Language:Python89659
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Language:Python10.4k849
360CVGroup/FancyVideo
This is the official reproduction of FancyVideo.
Language:Python55972
henrique-galimberti/i2v-workflow
Image-2-Video Workflow
171
Kosinkadink/ComfyUI-AnimateDiff-Evolved
Improved AnimateDiff for ComfyUI and Advanced Sampling Support
Language:Python2.6k196
Fictiverse/ComfyUI_Fictiverse_Workflows
ComfyUI Workflows
324
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python7.8k731
BadToBest/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python2.6k301
OpenTalker/SadTalker
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Language:Python11.7k2.2k
unconv/plagiarist-gpt
Fine-tune ChatGPT to write lyrics of your favorite artist
Language:Python82
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Language:Python16.4k1.1k
SociallyIneptWeeb/AICoverGen
A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.
Language:Python1k249
erew123/alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
Language:HTML938109
KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Language:Python1.7k162
KoljaB/RealtimeTTS
Converts text to speech in realtime
Language:Python1.8k161
vocodedev/vocode-core
🤖 Build voice-based LLM agents. Modular + open source.
Language:Python2.8k479