RioLLee
student of Northwestern Polytechnical University@Northwestern Polytechnical University
Northwestern Polytechnical University
RioLLee's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
facebookresearch/ConvNeXt
Code release for ConvNeXt model
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
hwanz/SSR-V2ray-Trojan
机场推荐与机场评测
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
krisleech/wisper
A micro library providing Ruby objects with Publish-Subscribe capabilities
facebookresearch/ConvNeXt-V2
Code release for ConvNeXt V2 model
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
OlaWod/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
akashmjn/tinydiarize
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
hitachi-speech/EEND
End-to-End Neural Diarization
DongKeon/Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
clovaai/aasist
Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"
TakHemlata/SSL_Anti-spoofing
This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".
Audio-WestlakeU/FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
BUTSpeechFIT/EEND
liyunlongaaa/NSD-MS2S
CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture
mogwai/nanodrz
Speaker Diarization with Transformers
TakHemlata/RawBoost-antispoofing
This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing".
BUTSpeechFIT/AMI-diarization-setup
BUTSpeechFIT/EEND_dataprep
fgnt/mms_msg
Multipurpose Multi Speaker Mixture Signal Generator
Maokui-He/NSD-MA-MSE
A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"
dihardchallenge/dihard3_baseline
tango4j/llm_speaker_tagging
SLT 2024 Challenge: Post-ASR-Speaker-Tagging
GeorgeEfstathiadis/LLM-Diarize-ASR-Agnostic
Repository for "LLM-based speaker diarization correction: A generalizable approach" paper
rvarma9604/enc_EEND
Implementation of the paper "End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors" by Shota Horiguchi et al.