fountainbird's Stars
Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
logtd/ComfyUI-HunyuanLoom
A set of nodes to edit videos using the Hunyuan Video model
logtd/ComfyUI-LTXTricks
A set of ComfyUI nodes providing additional control for the LTX Video model
huggingface/smol-course
A course on aligning smol models.
akatz-ai/ComfyUI-DepthCrafter-Nodes
A port of tencent/DepthCrafter into ComfyUI
HurroWorld/text-to-audio2face
Web interface to convert text to speech and route it to an Audio2Face streaming player.
kyutai-labs/moshi
akatz-ai/ComfyUI-X-Portrait-Nodes
Wrapper for X-Portrait for running in ComfyUI
molvqingtai/WebChat
💬 Chat with anyone on any website.
kijai/ComfyUI-GIMM-VFI
xg-chu/GAGAvatar
[NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar
stas00/ml-engineering
Machine Learning Engineering Open Book
wasiahmad/Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
USTC3DV/PortraitGen-code
nasa-jpl/rosa
ROSA 🤖 is an AI Agent designed to interact with ROS1- and ROS2-based robotics systems using natural language queries. ROSA helps robot developers inspect, diagnose, understand, and operate robots.
zju3dv/GVHMR
Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024
Future-House/paper-qa
High accuracy RAG for answering questions from scientific documents with citations
NeoVertex1/SuperPrompt
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
jnjaby/KEEP
[ECCV'24] Kalman-Inspired Feature Propagation for Video Face Super-Resolution
elder-plinius/L1B3RT4S
TOTALLY HARMLESS LIBERATION PROMPTS FOR GOOD LIL AI'S
X-LANCE/AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
R3gm/SoniTranslate
Synchronized Translation for Videos. Video dubbing
olegchomp/StreamDiffusion-NDI
kijai/comfyui-svd-temporal-controlnet
yangxy/PASD
[ECCV2024] Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization
purzbeats/purz-comfyui-workflows
Purz's ComfyUI Workflows
buaacyw/GaussianEditor
[CVPR 2024] GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
XPandora/PhysGaussian
[CVPR 2024 Highlight] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
cocktailpeanut/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production