Pinned Repositories
build-an-avatar-with-ASR-TTS-Transformer-Omniverse-Audio2Face
DeepSpeaker-pytorch
Speaker embedding(verification and recognition) using Pytorch
FAE-CV
This repo contains the reicpe to assemble a corpus for Foreign Accented English using the crowdsourced corpus Common Voice which contains (optional) accent labels.
FishBoardMix
The FishBoardMix corpus is designed to explore Speaker-Age estimation technology.
kaldi
This is the official location of the Kaldi project.
mcr2
Official Implementation of Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction (2020)
pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
speaker-embedding-with-phonetic-information
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
Speech-Emotion-Recognition
Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
speechbrain
A PyTorch-based Speech Toolkit
schaltung's Repositories
schaltung/FAE-CV
This repo contains the reicpe to assemble a corpus for Foreign Accented English using the crowdsourced corpus Common Voice which contains (optional) accent labels.
schaltung/FishBoardMix
The FishBoardMix corpus is designed to explore Speaker-Age estimation technology.
schaltung/build-an-avatar-with-ASR-TTS-Transformer-Omniverse-Audio2Face
schaltung/DeepSpeaker-pytorch
Speaker embedding(verification and recognition) using Pytorch
schaltung/kaldi
This is the official location of the Kaldi project.
schaltung/mcr2
Official Implementation of Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction (2020)
schaltung/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
schaltung/speaker-embedding-with-phonetic-information
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
schaltung/Speech-Emotion-Recognition
Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
schaltung/speechbrain
A PyTorch-based Speech Toolkit
schaltung/TasNet
A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.
schaltung/websocket-bridge
Websockets <-> Riva proxy service. Audiocodes compatible.
schaltung/whisper
schaltung/XXX-blockchain-starter-kit
Created for toolchain: https://console.ng.bluemix.net/devops/toolchains/b981acec-b692-43ec-b168-0e02169b75b2?env_id=ibm%3Ayp%3Aus-south