luosiwu's Stars
Francis-Rings/StableAvatar
We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a reference image and audio.
chipsalliance/verible
Verible is a suite of SystemVerilog developer tools, including a parser, style-linter, formatter and language server
suoto/hdl_checker
Repurposing existing HDL tools to help writing better code
MikePopoloski/slang
SystemVerilog compiler and language services
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
fishaudio/fish-speech
SOTA Open Source TTS
wangzhaode/mnn-llm
llm deploy project based mnn. This project has merged into MNN.
antonibigata/keysync
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
bytedance/MegaTTS3
antgroup/ditto-talkinghead
[ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis
warmshao/FasterLivePortrait
Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!
KwaiVGI/LivePortrait
Bring portraits to life!
alibaba/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/README.md). MNN TaoAvatar Android - Local 3D Avatar Intelligence: apps/Android/Mnn3dAvatar/README.md
OpenTalker/ToonTalker
[ICCV 2023]ToonTalker: Cross-Domain Face Reenactment
harlanhong/ICCV2023-MCNET
The official code of our ICCV2023 work: Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation
Wan-Video/Wan2.1
Wan: Open and Advanced Large-Scale Video Generative Models
Fantasy-AMAP/fantasy-talking
[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Holasyb918/HeyGem-Linux-Python-Hack
A docker free offline version for HeyGem; Python and Linux is all you need!
antgroup/echomimic_v2
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
megvii-research/megactor
github/github-mcp-server
GitHub's official MCP Server
jixiaozhong/Sonic
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
78/xiaozhi-esp32
An MCP-based chatbot | 一个基于MCP的聊天机器人
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
xinnan-tech/xiaozhi-esp32-server
本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.
om-ai-lab/VLM-R1
Solve Visual Understanding with Reinforced VLMs
nadermx/backgroundremover
Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.
kvcache-ai/ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
deepseek-ai/DeepSeek-V3