Pinned Repositories
3DGStream
[CVPR 2024 Highlight] Official repository for the paper "3DGStream: On-the-fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos".
awesome-architecture
架构师技术图谱,助你早日成为架构师
BakedAvatar
Pytorch Code for "BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis"
fksm
flume+kafka+storm实时日志分析
FlashAvatar-code
[CVPR 2024] The official repo for FlashAvatar
Gaussian-Head-Avatar
[CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"
GaussianTalker
hadoop
spark_redis
spark-redis
StyleAvatar
Code of SIGGRAPH 2023 Conference paper: StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video
wuzhongdehua's Repositories
wuzhongdehua/3DGS.cpp
A cross-platform, high performance renderer for Gaussian Splatting using Vulkan Compute. Supports ✅ Windows, Linux, macOS, iOS, and visionOS
wuzhongdehua/AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
wuzhongdehua/Controllable-RAG-Agent
This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated graph based algorithm to handle the tasks.
wuzhongdehua/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image (uncensored)
wuzhongdehua/depthsplat
DepthSplat: Connecting Gaussian Splatting and Depth
wuzhongdehua/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
wuzhongdehua/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System
wuzhongdehua/GAGAvatar
[NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar
wuzhongdehua/gaussian-splatting-lightning
A 3D Gaussian Splatting framework with various derived algorithms and an interactive web viewer
wuzhongdehua/GaussianAvatars2
[CVPR 2024 Highlight] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"
wuzhongdehua/goliath
Goliath Dataset Release
wuzhongdehua/gs-relight
Official Code Release for SIGGRAPH Asia 2024 Paper: GS^3: Efficient Relighting with Triple Gaussian Splatting
wuzhongdehua/hallo2
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
wuzhongdehua/HelloMeme
The official HelloMeme GitHub site
wuzhongdehua/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
wuzhongdehua/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model Training
wuzhongdehua/IDM-VTON
IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
wuzhongdehua/LivePortrait
Make one portrait alive!
wuzhongdehua/LiveTalking
Real time interactive streaming digital human
wuzhongdehua/LLaVA-NeXT
wuzhongdehua/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
wuzhongdehua/MimicTalk
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code
wuzhongdehua/MVSGaussian
[ECCV 2024] MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo
wuzhongdehua/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
wuzhongdehua/Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端🗝️、Docker-compose 打包部署🐋
wuzhongdehua/street_gaussians
[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting
wuzhongdehua/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
wuzhongdehua/Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
wuzhongdehua/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
wuzhongdehua/VideoChat
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.