NaohiroTawara's Stars
nttcslab-sp/mamba-diarization
Official repository for Mamba-based Segmentation Model for Speaker Diarization
FrenchKrab/SuperNecoraMario
Super Mario Bros. clone and its level editor, made in C
FrenchKrab/IS2023-powerset-diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
chimechallenge/chime-utils
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
BUTSpeechFIT/EEND
fgnt/meeteval
MeetEval - A meeting transcription evaluation toolkit
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
liyunlongaaa/NSD-MS2S
CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
nestyme/voice-morphing
voice matureness changer based on gender detector & spectral features modification
Audio-WestlakeU/FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
BUTSpeechFIT/DVBx
Discriminative Training of VBx Diarization
DongKeon/Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
meta-llama/codellama
Inference code for CodeLlama models
Maokui-He/NSD-MA-MSE
A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"
popcornell/CHiME7DASRDiarizationBaselineJSONs
Baseline diarization system predictions (JSONs and RTTMs) as obtained for the CHiME-7 DASR (Task 1) by me.
dodohow1011/TS-VAD
desh2608/dover-lap
Python package for combining diarization system outputs.
fgnt/padertorch
A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an emphasis on speech processing.
chomeyama/SiFiGAN
Official implementation of the source-filter HiFiGAN vocoder
hjimce/O2U-Net
paper "O2U-Net: A Simple Noisy Label Detection Approach for Deep Neural Networks" code
QuasarLight/Pytorch_Face_Recognition
Pytorch implementation of mainstream face recognition algorithms(ArcFace, CosFace).
BUTSpeechFIT/VBx
Variational Bayes HMM over x-vectors diarization
pyro-ppl/pyro
Deep universal probabilistic programming with Python and PyTorch
shangeth/wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
manavkaushik/Speech-Analysis-for-Speaker-Characteristics-Estimation
sarulab-speech/jtubespeech
hechmik/voxceleb_enrichment_age_gender
Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021
r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.