laixiao's Stars
kleinlee/MiniMates
The fastest digital human algorithm, now on your desktop.
yerfor/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
andrewyng/aisuite
Simple, unified interface to multiple Generative AI providers
ltaoo/wx_channels_download
微信视频号下载器
logtd/ComfyUI-HunyuanLoom
A set of nodes to edit videos using the Hunyuan Video model
microsoft/TRELLIS
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
anliyuan/Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
if-ai/ComfyUI-IF_MemoAvatar
Memory-Guided Diffusion for Expressive Talking Video Generation
kleinlee/DH_live
每个人都能用的数字人
Francis-Rings/StableAnimator
We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference image and a sequence of poses.
modelscope/ClearerVoice-Studio
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
SpenserCai/ComfyUI-FunAudioLLM
Comfyui custom node for FunAudioLLM include CosyVoice and SenseVoice
memoavatar/memo
Memory-Guided Diffusion for Expressive Talking Video Generation
Comfy-Org/desktop
The desktop app for ComfyUI.
RSSNext/Follow
🧡 Follow your favorites in one inbox
cooderl/wewe-rss
🤗更优雅的微信公众号订阅方式,支持私有化部署、微信公众号RSS生成(基于微信读书)v2.x
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
TeamWiseFlow/wiseflow
Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and uploads them to the database.
linyqh/NarratoAI
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
zuruoke/watermark-removal
a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image
yeates/PromptFix
[NeurIPS 24] PromptFix: You Prompt and We Fix the Photo
KwaiVGI/ComfyUI-KLingAI-API
damo-cv/RealisDance
The official implementation of RealisDance
Shadownc/cloudflare-mirror-site
google、github...镜像站
dairoot/ChatGPT-Mirror
🚀 一键部署个人的 ChatGPT 镜像站
smthemex/ComfyUI_EchoMimic
You can using EchoMimic in ComfyUI
antgroup/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
stackblitz-labs/bolt.diy
Prompt, run, edit, and deploy full-stack web applications using any LLM you want!
Henry-23/VideoChat
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
knowsuchagency/pdf-to-podcast
Convert any PDF into a podcast episode!