NaohiroTawara

NaohiroTawara's Stars

nttcslab-sp/mamba-diarization
Official repository for Mamba-based Segmentation Model for Speaker Diarization
Language:Python183
FrenchKrab/SuperNecoraMario
Super Mario Bros. clone and its level editor, made in C
Language:C1
FrenchKrab/IS2023-powerset-diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
Language:Jupyter Notebook704
chimechallenge/chime-utils
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
Language:Python213
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.1k1.3k
BUTSpeechFIT/EEND
Language:Python729
fgnt/meeteval
MeetEval - A meeting transcription evaluation toolkit
Language:Python7714
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Language:Python2.3k187
liyunlongaaa/NSD-MS2S
CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture
Language:Shell644
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook36k4.2k
nestyme/voice-morphing
voice matureness changer based on gender detector & spectral features modification
Language:Jupyter Notebook72
Audio-WestlakeU/FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
Language:Python814
BUTSpeechFIT/DVBx
Discriminative Training of VBx Diarization
Language:Python182
DongKeon/Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
2153
meta-llama/codellama
Inference code for CodeLlama models
Language:Python16k1.9k
Maokui-He/NSD-MA-MSE
A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"
Language:Shell442
popcornell/CHiME7DASRDiarizationBaselineJSONs
Baseline diarization system predictions (JSONs and RTTMs) as obtained for the CHiME-7 DASR (Task 1) by me.
2
dodohow1011/TS-VAD
Language:Python437
desh2608/dover-lap
Python package for combining diarization system outputs.
Language:Python7513
fgnt/padertorch
A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an emphasis on speech processing.
Language:Python7116
chomeyama/SiFiGAN
Official implementation of the source-filter HiFiGAN vocoder
Language:Python23334
hjimce/O2U-Net
paper "O2U-Net: A Simple Noisy Label Detection Approach for Deep Neural Networks" code
Language:Python7612
QuasarLight/Pytorch_Face_Recognition
Pytorch implementation of mainstream face recognition algorithms(ArcFace, CosFace).
Language:Python9816
BUTSpeechFIT/VBx
Variational Bayes HMM over x-vectors diarization
Language:Python25257
pyro-ppl/pyro
Deep universal probabilistic programming with Python and PyTorch
Language:Python8.6k987
shangeth/wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
Language:Python8914
manavkaushik/Speech-Analysis-for-Speaker-Characteristics-Estimation
Language:Python34
sarulab-speech/jtubespeech
Language:Python21346
hechmik/voxceleb_enrichment_age_gender
Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021
Language:Jupyter Notebook6215
r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
Language:Python39374