A curated list of awesome Voiceprint Recognition papers.
- Adaptive Margin Circle Loss for Speaker Verification
- Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances
- JOINT GENDER AND AGE ESTIMATION BASED ON SPEECH SIGNALS USING X-VECTORS AND TRANSFER LEARNING
- EfficientTDNN: Efficient Architecture Search for Speaker Recognition in the Wild
- Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances
- Out of a hundred trials, how many errors does your speaker verifier make?
- Unit selection synthesis based data augmentation for fixed phrase speaker verification
- Learnable MFCCs for Speaker Verification
- Triplet loss based embeddings for forensic speaker identification in Spanish
- Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech
- ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification | code | code
- In defence of metric learning for speaker recognition | code
- Augmentation adversarial training for unsupervised speaker recognition
- Double Multi-Head Attention for Speaker Verification
- UIAI System for Short-Duration Speaker Verification Challenge 2020
- Self Attentive Multi Layer Aggregation with Feature Recalibration and Normalization for End to End Speaker Verification System
- Probabilistic embeddings for speaker diarization
- Deep Normalization for Speaker Vectors
- Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification
- Learning to fool the speaker recognition
- Universal Adversarial Perturbations Generative Network for Speaker Recognition
- Multi-Scale Aggregation Using Feature Pyramid Module for Text-Independent Speaker Verification
- AM-MobileNet1D: A Portable Model for Speaker Recognition
- End-to-End Neural Diarization: Reformulating Speaker Diarization as Simple Multi-label Classification
- NPLDA: A Deep Neural PLDA Model for Speaker Verification
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition | code
- SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
- MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition
- Fundamental Frequency Feature Normalization and Data Augmentation for Child Speech Recognition
- Multichannel CRNN for Speaker Counting: an Analysis of Performance
- End-to-End Speaker Height and age estimation using Attention Mechanism with LSTM-RNN
- SoundCLR: Contrastive Learning of Representations For Improved Environmental Sound Classification
- Environmental Sound Classification on the Edge: Deep Acoustic Networks for Extremely Resource-Constrained Devices
- GISE-51: A scalable isolated sound events dataset
- Guided Training: A Simple Method for Single-channel Speaker Separation
- SubSpectral Normalization for Neural Audio Data Processing