Pinned Repositories
AttentionBasedProsodyPrediction
Encoder and Decoder and Attention Based Prosody Prediction
awesome-talking-head-corpus
something about avatar corpus
bert-as-service
Mapping a variable-length sentence to a fixed-length vector using BERT model
Crawler
抖音(a_bogus最新版)、快手、哔哩哔哩、小红书、淘宝、京东、微博平台帖子、评论、搜索高性能爬虫服务器。docker一键快速部署。它可以获取这些平台上的公开信息,但请遵守下方的的免责声明。
d2l_zh_tensorflow2.0
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现的部分代码改成tensorflow2.0的实现
DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
MegaPortraits
Unofficial Implementation of the paper "MegaPortraits: One-shot Megapixel Neural Head Avatars"
MODA
Official Repository for ICCV-2023 MODA
NLP-BERT--ChineseVersion
谷歌自然语言处理模型BERT:论文解析与python代码
NLP_baselines
some baselines for different tasks
taichuai's Repositories
taichuai/awesome-talking-head-corpus
something about avatar corpus
taichuai/DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
taichuai/AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
taichuai/ava-256
Train universal codec avatars
taichuai/DH_live
每个人都能用的数字人
taichuai/DiffPoseTalk
DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models
taichuai/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
taichuai/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
taichuai/EMOPortraits
Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars
taichuai/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
taichuai/FasterLivePortrait
Bring portraits to life in Real Time!onnx/tensorrt support!
taichuai/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
taichuai/FollowYourEmoji
[arXiv 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"
taichuai/hallo2
taichuai/hertz-dev
first base model for full-duplex conversational audio
taichuai/JoyVASA
taichuai/Latte
Latte: Latent Diffusion Transformer for Video Generation.
taichuai/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
taichuai/LivePortrait
Make one portrait alive!
taichuai/models
The best OSS video generation models
taichuai/ProbTalk3D
taichuai/rectified-flow
从零手搓Flow Matching(Rectified Flow)
taichuai/robpitch
A pitch detection model trained to be robust against noise and reverberation environments.
taichuai/SAiD
SAiD: Blendshape-based Audio-Driven Speech Animation with Diffusion
taichuai/talking_face_preprocessing
Preprocessing Scipts for Talking Face Generation
taichuai/TANGO
Great repo
taichuai/tiny-llm-zh
从零实现一个小参数量中文大语言模型。
taichuai/Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
taichuai/VideoChat
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
taichuai/virtual_human_stream
The "virtual_human_stream" project is a real-time digital human system supporting audio-video dialogue. It integrates models like ernerf, musetalk, and wav2lip for voice cloning, video stitching, and streaming via RTMP/WebRTC. It’s optimized for high performance and easy customization, with support for ChatGPT dialogue integration.