laixiao

laixiao's Stars

kleinlee/MiniMates
The fastest digital human algorithm, now on your desktop.
Language:Python36834
yerfor/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
Language:Python1.6k235
andrewyng/aisuite
Simple, unified interface to multiple Generative AI providers
Language:Python9.5k852
ltaoo/wx_channels_download
微信视频号下载器
Language:Go41355
logtd/ComfyUI-HunyuanLoom
A set of nodes to edit videos using the Hunyuan Video model
Language:Python1224
microsoft/TRELLIS
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
Language:Python5.7k357
anliyuan/Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
Language:Python1.4k201
if-ai/ComfyUI-IF_MemoAvatar
Memory-Guided Diffusion for Expressive Talking Video Generation
Language:Python1257
kleinlee/DH_live
每个人都能用的数字人
Language:Python855179
Francis-Rings/StableAnimator
We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference image and a sequence of poses.
Language:Python1k47
modelscope/ClearerVoice-Studio
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Language:Python1.9k135
SpenserCai/ComfyUI-FunAudioLLM
Comfyui custom node for FunAudioLLM include CosyVoice and SenseVoice
Language:Python605
memoavatar/memo
Memory-Guided Diffusion for Expressive Talking Video Generation
Language:Python58756
Comfy-Org/desktop
The desktop app for ComfyUI.
Language:TypeScript83249
RSSNext/Follow
🧡 Follow your favorites in one inbox
Language:TypeScript21.4k887
cooderl/wewe-rss
🤗更优雅的微信公众号订阅方式，支持私有化部署、微信公众号RSS生成（基于微信读书）v2.x
Language:TypeScript5.7k987
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python7.5k800
TeamWiseFlow/wiseflow
Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and uploads them to the database.
Language:Python5.4k946
linyqh/NarratoAI
利用AI大模型，一键解说并剪辑视频； Using AI models to automatically provide commentary and edit videos with a single click.
Language:Python3.1k334
zuruoke/watermark-removal
a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image
Language:Python2.2k329
yeates/PromptFix
[NeurIPS 24] PromptFix: You Prompt and We Fix the Photo
Language:Python68237
KwaiVGI/ComfyUI-KLingAI-API
Language:Python604
damo-cv/RealisDance
The official implementation of RealisDance
Language:C27815
Shadownc/cloudflare-mirror-site
google、github...镜像站
Language:JavaScript73
dairoot/ChatGPT-Mirror
🚀 一键部署个人的 ChatGPT 镜像站
Language:Vue828158
smthemex/ComfyUI_EchoMimic
You can using EchoMimic in ComfyUI
Language:Python47244
antgroup/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Language:Python2.1k242
stackblitz-labs/bolt.diy
Prompt, run, edit, and deploy full-stack web applications using any LLM you want!
Language:TypeScript7.9k3.7k
Henry-23/VideoChat
实时语音交互数字人，支持端到端语音方案（GLM-4-Voice - THG）和级联方案（ASR-LLM-TTS-THG）。可自定义形象与音色，无须训练，支持音色克隆，首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
Language:Python57976
knowsuchagency/pdf-to-podcast
Convert any PDF into a podcast episode!
Language:Python638265