yahcong

yahcong's Stars

rabitt/ismir2017-deepsalience
Companion code for ISMIR 2017 paper "Deep Salience Representations for $F_0$ Estimation in Polyphonic Music"
Language:Jupyter Notebook8419
philipperemy/speaker-change-detection
Paper: https://arxiv.org/abs/1702.02285
Language:Python6220
BornInWater/Overlap-Detection
Overlapped Speech detection in Multi-party Conversations
Language:Python185
shvmshukla/Speaker-Change-Detection
Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies on efficient use of temporal information from extracted audio features.
Language:Jupyter Notebook112
yinruiqing/change_detection
Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks
6315
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language:Shell14.3k5.3k
Jamiroquai88/VBDiarization
Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data
Language:Python9529
Janghyun1230/Speaker_Verification
Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"
Language:Python361103
Suhee05/Text-Independent-Speaker-Verification
Text Independent Speaker Verification Using GE2E Loss
Language:Python8350
HaiFengZeng/GE2E
Language:Python34
funcwj/ge2e-speaker-verification
Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"
Language:Python10125
astorfi/3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
Language:Python782274
wangleiai/dVectorSpeakerRecognition
基于dVector的说话人识别keras
Language:Python8734
espnet/espnet
End-to-End Speech Processing Toolkit
Language:Python8.5k2.2k
google/end-to-end
End-To-End is a crypto library to encrypt, decrypt, digital sign, and verify signed messages (implementing OpenPGP)
Language:JavaScript4.1k298
keras-team/keras
Deep Learning for humans
Language:Python62.1k19.5k
AKBoles/Deep-Learning-Speech-Recognition
Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.
Language:Jupyter Notebook4726
pyannote/pyannote-metrics
A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
Language:Python19134
crystal-method/Looking-to-Listen
Language:Python409
google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Language:Python1.6k319
hbredin/TristouNet
TristouNet: Triplet Loss for Speaker Turn Embedding
Language:Python12535
juanjobosch/SourceFilterContoursMelody
Melody extraction based on source-filter modelling
Language:Python269
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Language:C++25.4k4k
yinruiqing/diarization_with_neural_approach
Language:Python143
Felix-Yan/FastICA
A python version of fast and robust ICA based on the paper of Aapo Hyvärinen.
Language:Python297
ZhihaoDU/speech_feature_extractor
Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.
Language:Python12241
justinsalamon/melosynth
Synthesize a continuous pitch sequence
Language:Python3611
justinsalamon/scaper
A library for soundscape synthesis and augmentation
Language:Python38256
marl/crepe
CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)
Language:Python1.1k160
ankitshah009/Task-4-Large-scale-weakly-supervised-sound-event-detection-for-smart-cars
Task 4 Large-scale weakly supervised sound event detection for smart cars
Language:Python6531

yahcong

yahcong's Stars

rabitt/ismir2017-deepsalience

philipperemy/speaker-change-detection

BornInWater/Overlap-Detection

shvmshukla/Speaker-Change-Detection

yinruiqing/change_detection

kaldi-asr/kaldi

Jamiroquai88/VBDiarization

Janghyun1230/Speaker_Verification

Suhee05/Text-Independent-Speaker-Verification

HaiFengZeng/GE2E

funcwj/ge2e-speaker-verification

astorfi/3D-convolutional-speaker-recognition

wangleiai/dVectorSpeakerRecognition

espnet/espnet

google/end-to-end

keras-team/keras

AKBoles/Deep-Learning-Speech-Recognition

pyannote/pyannote-metrics

crystal-method/Looking-to-Listen

google/uis-rnn

hbredin/TristouNet

juanjobosch/SourceFilterContoursMelody

mozilla/DeepSpeech

yinruiqing/diarization_with_neural_approach

Felix-Yan/FastICA

ZhihaoDU/speech_feature_extractor

justinsalamon/melosynth

justinsalamon/scaper

marl/crepe

ankitshah009/Task-4-Large-scale-weakly-supervised-sound-event-detection-for-smart-cars