RioLLee

student of Northwestern Polytechnical University@Northwestern Polytechnical University

Northwestern Polytechnical University

RioLLee's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python133k 1.1k 15.9k26.7k
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python11.9k 135 6961.3k
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python6.4k 62 1.1k686
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.1k 72 991764
facebookresearch/ConvNeXt
Code release for ConvNeXt model
Language:Python5.7k 32 130695
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python4.2k 49 236409
hwanz/SSR-V2ray-Trojan
机场推荐与机场评测
3.7k 52 097
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Language:Jupyter Notebook3.5k 43 184301
krisleech/wisper
A micro library providing Ruby objects with Publish-Subscribe capabilities
Language:Ruby3.3k 49 100151
facebookresearch/ConvNeXt-V2
Code release for ConvNeXt V2 model
Language:Python1.5k 7 71118
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python1.1k 18 10098
OlaWod/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Language:Python595 19 85109
akashmjn/tinydiarize
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
Language:Python434 25 1514
hitachi-speech/EEND
End-to-End Neural Diarization
Language:Python368 17 4657
DongKeon/Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
203 11 03
clovaai/aasist
Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"
Language:Python171 7 940
TakHemlata/SSL_Anti-spoofing
This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".
Language:Python103 5 525
Audio-WestlakeU/FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
Language:Python76 3 114
BUTSpeechFIT/EEND
Language:Python71 8 89
liyunlongaaa/NSD-MS2S
CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture
Language:Shell63 3 84
mogwai/nanodrz
Speaker Diarization with Transformers
Language:Jupyter Notebook58 6 31
TakHemlata/RawBoost-antispoofing
This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing".
Language:Python50 1 511
BUTSpeechFIT/AMI-diarization-setup
48 5 022
BUTSpeechFIT/EEND_dataprep
Language:Shell47 5 87
fgnt/mms_msg
Multipurpose Multi Speaker Mixture Signal Generator
Language:Python43 6 08
Maokui-He/NSD-MA-MSE
A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"
Language:Shell43 4 22
dihardchallenge/dihard3_baseline
Language:Perl27 3 16
tango4j/llm_speaker_tagging
SLT 2024 Challenge: Post-ASR-Speaker-Tagging
Language:Python13 2 11
GeorgeEfstathiadis/LLM-Diarize-ASR-Agnostic
Repository for "LLM-based speaker diarization correction: A generalizable approach" paper
Language:Jupyter Notebook10 1 00
rvarma9604/enc_EEND
Implementation of the paper "End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors" by Shota Horiguchi et al.
Language:Python31

RioLLee

RioLLee's Stars

huggingface/transformers

m-bain/whisperX

modelscope/FunASR

pyannote/pyannote-audio

facebookresearch/ConvNeXt

snakers4/silero-vad

hwanz/SSR-V2ray-Trojan

MahmoudAshraf97/whisper-diarization

krisleech/wisper

facebookresearch/ConvNeXt-V2

modelscope/3D-Speaker

OlaWod/FreeVC

akashmjn/tinydiarize

hitachi-speech/EEND

DongKeon/Awesome-Speaker-Diarization

clovaai/aasist

TakHemlata/SSL_Anti-spoofing

Audio-WestlakeU/FS-EEND

BUTSpeechFIT/EEND

liyunlongaaa/NSD-MS2S

mogwai/nanodrz

TakHemlata/RawBoost-antispoofing

BUTSpeechFIT/AMI-diarization-setup

BUTSpeechFIT/EEND_dataprep

fgnt/mms_msg

Maokui-He/NSD-MA-MSE

dihardchallenge/dihard3_baseline

tango4j/llm_speaker_tagging

GeorgeEfstathiadis/LLM-Diarize-ASR-Agnostic

rvarma9604/enc_EEND