luosiwu

luosiwu's Stars

Francis-Rings/StableAvatar
We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a reference image and audio.
Language:Python99281
chipsalliance/verible
Verible is a suite of SystemVerilog developer tools, including a parser, style-linter, formatter and language server
Language:C++1.6k250
suoto/hdl_checker
Repurposing existing HDL tools to help writing better code
Language:Python21725
MikePopoloski/slang
SystemVerilog compiler and language services
Language:C++834172
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
Language:Python8.6k1.2k
fishaudio/fish-speech
SOTA Open Source TTS
Language:Python23k1.9k
wangzhaode/mnn-llm
llm deploy project based mnn. This project has merged into MNN.
Language:C++1.6k177
antonibigata/keysync
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
Language:Jupyter Notebook35936
bytedance/MegaTTS3
Language:Python5.9k469
antgroup/ditto-talkinghead
[ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis
Language:Python47485
warmshao/FasterLivePortrait
Bring portraits to life in Real Time！onnx/tensorrt support！实时肖像驱动！
Language:Python97093
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python17.1k1.8k
alibaba/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/README.md). MNN TaoAvatar Android - Local 3D Avatar Intelligence: apps/Android/Mnn3dAvatar/README.md
Language:C++13.1k2.1k
OpenTalker/ToonTalker
[ICCV 2023]ToonTalker: Cross-Domain Face Reenactment
Language:Python12215
harlanhong/ICCV2023-MCNET
The official code of our ICCV2023 work: Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation
Language:Python25324
Wan-Video/Wan2.1
Wan: Open and Advanced Large-Scale Video Generative Models
Language:Python14.1k1.9k
Fantasy-AMAP/fantasy-talking
[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Language:Python1.6k123
Holasyb918/HeyGem-Linux-Python-Hack
A docker free offline version for HeyGem; Python and Linux is all you need!
Language:Python33581
antgroup/echomimic_v2
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Language:Python4.3k502
megvii-research/megactor
Language:Python897122
github/github-mcp-server
GitHub's official MCP Server
Language:Go22.7k2.5k
jixiaozhong/Sonic
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
Language:Python3k259
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python88.7k9.9k
78/xiaozhi-esp32
An MCP-based chatbot | 一个基于MCP的聊天机器人
Language:C++18.6k3.8k
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python6.6k604
xinnan-tech/xiaozhi-esp32-server
本项目为xiaozhi-esp32提供后端服务，帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.
Language:Python6.7k2.3k
om-ai-lab/VLM-R1
Solve Visual Understanding with Reinforced VLMs
Language:Python5.5k351
nadermx/backgroundremover
Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.
Language:Python7.5k621
kvcache-ai/ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Language:Python15.1k1.1k
deepseek-ai/DeepSeek-V3
Language:Python99.3k16.2k