Shirley-0708

SCUT

Shirley-0708's Stars

chatchat-space/Langchain-Chatchat
Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
Language:TypeScript33.1k 288 4k5.7k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.8k 425 4.2k6.4k
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language:Shell14.5k 694 1.7k5.3k
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Language:Python7.5k 57 1951.2k
InternLM/InternLM
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
Language:Python6.7k 56 343477
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Language:Python1.4k 61 241164
MontrealCorpusTools/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
Language:Python1.4k 35 728251
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Language:Jupyter Notebook1.2k 17 137221
PantoMatrix/PantoMatrix
PantoMatrix: Generating Face and Body Animation from Speech
Language:Python936 54 178182
Rubikplayer/flame-fitting
Example code for the FLAME 3D head model. The code demonstrates how to sample 3D heads from the model, fit the model to 3D keypoints and 3D scans.
Language:Python750 25 58112
MPI-IS/mesh
MPI-IS Mesh Processing Library
Language:Python683 17 103156
TimoBolkart/TF_FLAME
Tensorflow framework for the FLAME 3D head model. The code demonstrates how to sample 3D heads from the model, fit the model to 2D or 3D keypoints, and how to generate textured head meshes from Images.
Language:Python457 22 6578
mpc001/Lipreading_using_Temporal_Convolutional_Networks
ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
Language:Python403 8 65103
MRzzm/HDTF
the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"
Language:Python362 14 2167
psyai-net/EmoTalk_release
This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"
Language:Python362 12 3134
coletdjnz/yt-dlp-youtube-oauth2
[OBSOLETE] Plugin that adds OAuth2 login support to yt-dlp's YouTube extractors
Language:Python255 6 4436
GuoCoder/ai-app
本项目旨在分享人工智能相关应用技术以及实战经验，包括大模型、语音合成、数字人、图像生成等。
Language:Python160 5 236
psyai-net/SelfTalk_release
This is the official source for our ACM MM 2023 paper "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces""
Language:MATLAB135 8 1015
rupakvignesh/Lyrics-to-Audio-Alignment
Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structures and then uses hidden markov models to obtain alignment within segments. The final alignment is concatenation of time stamps of lyrics within the segments for each song.
Language:Python90 6 822
f90/jamendolyrics
Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation
Language:Python80 9 410
uuembodiedsocialai/ProbTalk3D
Language:Python71 5 410
jhuang448/LyricsAlignment-MTL
Language:Python57 4 312
SwagLyrics/autosynch
Automated lyrics-to-audio alignment using syllabic nuclei detection. Developed during Google Summer of Code 2019.
Language:Python50 6 69
CvHadesSun/FLame2SMPLX
A tool to tranform the flame texture space,shape and pose paramerter into SMPL or SMPLX model 's head(or face).
Language:Python38 3 42
schufo/lyrics-aligner
Automatic lyrics alignment at phoneme or word level with a pre-trained deep neural network.
Language:Python31 1 34
MediaBrain-SJTU/JRTransformer
[ICCV2023] Joint-Relation Transformer for Multi-Person Motion Prediction
Language:Python25 1 33
york135/MIRMLPop
The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics and Audio," ICASSP 2024.
Language:Python23 1 02
ras0k/auto-lyrics
Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp
Language:Jupyter Notebook18 1 14
laurenceyoon/real-time-lyrics-alignment
Codebase for 'A Real-Time Lyrics Alignment System Using Chroma And Phonetic Features For Classical Vocal Performance', ICASSP 2024
Language:Python12 1 01
MusicGeneration/SongMASS
Language:HTML41

Shirley-0708

Shirley-0708's Stars

chatchat-space/Langchain-Chatchat

facebookresearch/fairseq

kaldi-asr/kaldi

facebookresearch/mae

InternLM/InternLM

ZiqiaoPeng/SyncTalk

MontrealCorpusTools/Montreal-Forced-Aligner

YuanGongND/ast

PantoMatrix/PantoMatrix

Rubikplayer/flame-fitting

MPI-IS/mesh

TimoBolkart/TF_FLAME

mpc001/Lipreading_using_Temporal_Convolutional_Networks

MRzzm/HDTF

psyai-net/EmoTalk_release

coletdjnz/yt-dlp-youtube-oauth2

GuoCoder/ai-app

psyai-net/SelfTalk_release

rupakvignesh/Lyrics-to-Audio-Alignment

f90/jamendolyrics

uuembodiedsocialai/ProbTalk3D

jhuang448/LyricsAlignment-MTL

SwagLyrics/autosynch

CvHadesSun/FLame2SMPLX

schufo/lyrics-aligner

MediaBrain-SJTU/JRTransformer

york135/MIRMLPop

ras0k/auto-lyrics

laurenceyoon/real-time-lyrics-alignment

MusicGeneration/SongMASS