taichuai

Focus, focus, focus

Mars

Pinned Repositories

AttentionBasedProsodyPrediction
Encoder and Decoder and Attention Based Prosody Prediction
Language:Python1 1 00
awesome-talking-head-corpus
something about avatar corpus
1 1 00
bert-as-service
Mapping a variable-length sentence to a fixed-length vector using BERT model
Language:Python1 1 00
Crawler
抖音(a_bogus最新版)、快手、哔哩哔哩、小红书、淘宝、京东、微博平台帖子、评论、搜索高性能爬虫服务器。docker一键快速部署。它可以获取这些平台上的公开信息，但请遵守下方的的免责声明。
Language:JavaScript1 0 00
d2l_zh_tensorflow2.0
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现的部分代码改成tensorflow2.0的实现
Language:Jupyter Notebook12 2 05
DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
Language:Python1 0 00
MegaPortraits
Unofficial Implementation of the paper "MegaPortraits: One-shot Megapixel Neural Head Avatars"
Language:Python3 0 01
MODA
Official Repository for ICCV-2023 MODA
Language:Python4 0 03
NLP-BERT--ChineseVersion
谷歌自然语言处理模型BERT：论文解析与python代码
Language:Python2 2 00
NLP_baselines
some baselines for different tasks
1 2 00

taichuai's Repositories

taichuai/awesome-talking-head-corpus
something about avatar corpus
1 1 00
taichuai/DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
Language:Python1 0 00
taichuai/AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
Language:Jupyter Notebook0 0
taichuai/ava-256
Train universal codec avatars
Language:Jupyter Notebook0 0
taichuai/DH_live
每个人都能用的数字人
Language:Python
taichuai/DiffPoseTalk
DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models
Language:Python0 0
taichuai/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python0 0
taichuai/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
taichuai/EMOPortraits
Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars
Language:Jupyter Notebook0 0
taichuai/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python
taichuai/FasterLivePortrait
Bring portraits to life in Real Time！onnx/tensorrt support！
Language:Python0 0
taichuai/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
Language:Python0 0
taichuai/FollowYourEmoji
[arXiv 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"
Language:Python0 0
taichuai/hallo2
taichuai/hertz-dev
first base model for full-duplex conversational audio
taichuai/JoyVASA
Language:Python
taichuai/Latte
Latte: Latent Diffusion Transformer for Video Generation.
Language:Python0 0
taichuai/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
Language:Python0 0
taichuai/LivePortrait
Make one portrait alive!
Language:Python
taichuai/models
The best OSS video generation models
taichuai/ProbTalk3D
taichuai/rectified-flow
从零手搓Flow Matching（Rectified Flow）
taichuai/robpitch
A pitch detection model trained to be robust against noise and reverberation environments.
taichuai/SAiD
SAiD: Blendshape-based Audio-Driven Speech Animation with Diffusion
Language:Python0 0
taichuai/talking_face_preprocessing
Preprocessing Scipts for Talking Face Generation
Language:Python0 0
taichuai/TANGO
Great repo
taichuai/tiny-llm-zh
从零实现一个小参数量中文大语言模型。
taichuai/Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
Language:Python1
taichuai/VideoChat
实时语音交互数字人，支持端到端语音方案（GLM-4-Voice - THG）和级联方案（ASR-LLM-TTS-THG）。可自定义形象与音色，无须训练，支持音色克隆，首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
taichuai/virtual_human_stream
The "virtual_human_stream" project is a real-time digital human system supporting audio-video dialogue. It integrates models like ernerf, musetalk, and wav2lip for voice cloning, video stitching, and streaming via RTMP/WebRTC. It’s optimized for high performance and easy customization, with support for ChatGPT dialogue integration.