xuanjihe

Pinned Repositories

Additive-Margin-Softmax
This is the implementation of paper <Additive Margin Softmax for Face Verification>
Language:Jupyter Notebook0 1 00
AIR-ASVspoof
Implementation of the paper "One-class Learning towards Generalized Voice Spoofing Detection"
Language:Python0 1 00
cmu-thesis
Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling
Language:Python1 1 00
netVLAD
netVLAD implementation in TensorFlow
Language:Python1 1 00
self-attentive-emb-tf
Simple Tensorflow Implementation of "A Structured Self-attentive Sentence Embedding" (ICLR 2017)
Language:Python2 1 00
Speech-Emotion-Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Language:Jupyter Notebook1 1 01
speech-emotion-recognition
speech emotion recognition using a convolutional recurrent networks based on IEMOCAP
Language:Python388 13 43142
Speech_Emotion_Recognition_DNN-ELM
Implementation of Speech Emotion Recognition using DNN-ELM
Language:Python1 1 02
tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
Language:Python1 1 00
wav2letter
Facebook AI Research Automatic Speech Recognition Toolkit
Language:C++1 1 00

xuanjihe's Repositories

xuanjihe/speech-emotion-recognition
speech emotion recognition using a convolutional recurrent networks based on IEMOCAP
Language:Python388 13 43142
xuanjihe/cmu-thesis
Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling
Language:Python1 1 00
xuanjihe/Speech-Emotion-Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Language:Jupyter Notebook1 1 01
xuanjihe/tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
Language:Python1 1 00
xuanjihe/wav2letter
Facebook AI Research Automatic Speech Recognition Toolkit
Language:C++1 1 00
xuanjihe/AIR-ASVspoof
Implementation of the paper "One-class Learning towards Generalized Voice Spoofing Detection"
Language:Python0 1 00
xuanjihe/ASSERT
JHU's system submission to the ASVspoof 2019 Challenge: Anti-Spoofing with Squeeze-Excitation and Residual neTworks (ASSERT).
Language:MATLAB1 0
xuanjihe/Auto-Tuning-Spectral-Clustering
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
Language:Python1 0
xuanjihe/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
1 0
xuanjihe/CircleLoss
Pytorch implementation of the paper "Circle Loss: A Unified Perspective of Pair Similarity Optimization"
Language:Python1 0
xuanjihe/Dcase2018_pooling
Repo for our pooling approach on the DCASE2018 task4
Language:Python1 0
xuanjihe/deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
Language:Python1 0
xuanjihe/ECAPA-TDNN
Language:Python1 0
xuanjihe/Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Language:Python1 0
xuanjihe/GradientReversal
Gradient Reversal Layer for Domain Adaptation
Language:Python0 0
xuanjihe/kaldi
This is now the official location of the Kaldi project.
Language:Shell1 0
xuanjihe/MomentumContrast.pytorch
Reproduction of Momentum Contrast for Unsupervised Visual Representation Learning
Language:Python1 0
xuanjihe/prefetch_generator
Simple package that makes your generator work in background thread
Language:Jupyter Notebook1 0
xuanjihe/pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
Language:Python1 0
xuanjihe/pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
Language:Python1 0
xuanjihe/QAMFace
Pytorch implementation of Quadratic Additive Angular Margin Loss for Face Recognition
Language:Python1 0
xuanjihe/speaker_embedding_moco
Language:Python1 0
xuanjihe/Speaker_Verification
Tensorflow implementation of generalized end-to-end loss for speaker verification
Language:Python1 0
xuanjihe/spec_augment
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Language:Jupyter Notebook1 0
xuanjihe/SpectralCluster
Python re-implementation of the spectral clustering algorithm in the paper "Speaker Diarization with LSTM"
Language:Python1 0
xuanjihe/Speech_emotion_recognition_BLSTM
Bidirectional LSTM network for speech emotion recognition.
Language:Python1 0
xuanjihe/SphereFace
This is a MNIST Implementation for <SphereFace: Deep Hypersphere Embedding for Face Recognition> in CVPR'17.
Language:Python1 0
xuanjihe/tensorflow-triplet-loss
Implementation of triplet loss in TensorFlow
Language:Python1 0
xuanjihe/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Language:Python1 0
xuanjihe/VBx
Variational Bayes HMM over x-vectors diarization
Language:Python1 0