Shirley-0708's Stars
chatchat-space/Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
InternLM/InternLM
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
MontrealCorpusTools/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
PantoMatrix/PantoMatrix
PantoMatrix: Generating Face and Body Animation from Speech
Rubikplayer/flame-fitting
Example code for the FLAME 3D head model. The code demonstrates how to sample 3D heads from the model, fit the model to 3D keypoints and 3D scans.
MPI-IS/mesh
MPI-IS Mesh Processing Library
TimoBolkart/TF_FLAME
Tensorflow framework for the FLAME 3D head model. The code demonstrates how to sample 3D heads from the model, fit the model to 2D or 3D keypoints, and how to generate textured head meshes from Images.
mpc001/Lipreading_using_Temporal_Convolutional_Networks
ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
MRzzm/HDTF
the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"
psyai-net/EmoTalk_release
This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"
coletdjnz/yt-dlp-youtube-oauth2
[OBSOLETE] Plugin that adds OAuth2 login support to yt-dlp's YouTube extractors
GuoCoder/ai-app
本项目旨在分享人工智能相关应用技术以及实战经验,包括大模型、语音合成、数字人、图像生成等。
psyai-net/SelfTalk_release
This is the official source for our ACM MM 2023 paper "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces""
rupakvignesh/Lyrics-to-Audio-Alignment
Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structures and then uses hidden markov models to obtain alignment within segments. The final alignment is concatenation of time stamps of lyrics within the segments for each song.
f90/jamendolyrics
Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation
uuembodiedsocialai/ProbTalk3D
jhuang448/LyricsAlignment-MTL
SwagLyrics/autosynch
Automated lyrics-to-audio alignment using syllabic nuclei detection. Developed during Google Summer of Code 2019.
CvHadesSun/FLame2SMPLX
A tool to tranform the flame texture space,shape and pose paramerter into SMPL or SMPLX model 's head(or face).
schufo/lyrics-aligner
Automatic lyrics alignment at phoneme or word level with a pre-trained deep neural network.
MediaBrain-SJTU/JRTransformer
[ICCV2023] Joint-Relation Transformer for Multi-Person Motion Prediction
york135/MIRMLPop
The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics and Audio," ICASSP 2024.
ras0k/auto-lyrics
Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp
laurenceyoon/real-time-lyrics-alignment
Codebase for 'A Real-Time Lyrics Alignment System Using Chroma And Phonetic Features For Classical Vocal Performance', ICASSP 2024
MusicGeneration/SongMASS