Pinned Repositories
Additive-Margin-Softmax
This is the implementation of paper <Additive Margin Softmax for Face Verification>
AIR-ASVspoof
Implementation of the paper "One-class Learning towards Generalized Voice Spoofing Detection"
cmu-thesis
Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling
netVLAD
netVLAD implementation in TensorFlow
self-attentive-emb-tf
Simple Tensorflow Implementation of "A Structured Self-attentive Sentence Embedding" (ICLR 2017)
Speech-Emotion-Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
speech-emotion-recognition
speech emotion recognition using a convolutional recurrent networks based on IEMOCAP
Speech_Emotion_Recognition_DNN-ELM
Implementation of Speech Emotion Recognition using DNN-ELM
tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
wav2letter
Facebook AI Research Automatic Speech Recognition Toolkit
xuanjihe's Repositories
xuanjihe/speech-emotion-recognition
speech emotion recognition using a convolutional recurrent networks based on IEMOCAP
xuanjihe/cmu-thesis
Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling
xuanjihe/Speech-Emotion-Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
xuanjihe/tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
xuanjihe/wav2letter
Facebook AI Research Automatic Speech Recognition Toolkit
xuanjihe/AIR-ASVspoof
Implementation of the paper "One-class Learning towards Generalized Voice Spoofing Detection"
xuanjihe/ASSERT
JHU's system submission to the ASVspoof 2019 Challenge: Anti-Spoofing with Squeeze-Excitation and Residual neTworks (ASSERT).
xuanjihe/Auto-Tuning-Spectral-Clustering
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
xuanjihe/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
xuanjihe/CircleLoss
Pytorch implementation of the paper "Circle Loss: A Unified Perspective of Pair Similarity Optimization"
xuanjihe/Dcase2018_pooling
Repo for our pooling approach on the DCASE2018 task4
xuanjihe/deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
xuanjihe/ECAPA-TDNN
xuanjihe/Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
xuanjihe/GradientReversal
Gradient Reversal Layer for Domain Adaptation
xuanjihe/kaldi
This is now the official location of the Kaldi project.
xuanjihe/MomentumContrast.pytorch
Reproduction of Momentum Contrast for Unsupervised Visual Representation Learning
xuanjihe/prefetch_generator
Simple package that makes your generator work in background thread
xuanjihe/pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
xuanjihe/pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
xuanjihe/QAMFace
Pytorch implementation of Quadratic Additive Angular Margin Loss for Face Recognition
xuanjihe/speaker_embedding_moco
xuanjihe/Speaker_Verification
Tensorflow implementation of generalized end-to-end loss for speaker verification
xuanjihe/spec_augment
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
xuanjihe/SpectralCluster
Python re-implementation of the spectral clustering algorithm in the paper "Speaker Diarization with LSTM"
xuanjihe/Speech_emotion_recognition_BLSTM
Bidirectional LSTM network for speech emotion recognition.
xuanjihe/SphereFace
This is a MNIST Implementation for <SphereFace: Deep Hypersphere Embedding for Face Recognition> in CVPR'17.
xuanjihe/tensorflow-triplet-loss
Implementation of triplet loss in TensorFlow
xuanjihe/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
xuanjihe/VBx
Variational Bayes HMM over x-vectors diarization