Pinned Repositories
BiLatticeRNN-Confidence
Enriching LatticeRNN with sub-word level features for confidence score prediction
blabbertabber
Android Application to perform Speaker Diarization
btp
All code for my final year B.Tech Project on speaker diarization and speaker verification
change_detection
Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks
convo_segment
A script for unsupervised labeling of changes of speaker in a (polite) conversation or interview. Works by clustering speech features (means of MFCCs over 1 second time windows) . Uses the python_speech_features module.
ctci
Cracking the Coding Interview, 5th Edition
CtCI-6th-Edition
Cracking the Coding Interview 6th Ed. Solutions
Deep-Learning-Speech-Recognition
Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.
deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
1215thebqtic's Repositories
1215thebqtic/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
1215thebqtic/BiLatticeRNN-Confidence
Enriching LatticeRNN with sub-word level features for confidence score prediction
1215thebqtic/blabbertabber
Android Application to perform Speaker Diarization
1215thebqtic/change_detection
Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks
1215thebqtic/ctci
Cracking the Coding Interview, 5th Edition
1215thebqtic/CtCI-6th-Edition
Cracking the Coding Interview 6th Ed. Solutions
1215thebqtic/Deep-Learning-Speech-Recognition
Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.
1215thebqtic/deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
1215thebqtic/Diarization_BIC
Speaker Diarization using agglomerative clustering
1215thebqtic/DiarTk
A fork of Idiap Research Institute's DiarTk diarization toolkit
1215thebqtic/DiViMe
Diarization Tools from JSALT 2017
1215thebqtic/DNC
Discriminative Neural Clustering for Speaker Diarisation
1215thebqtic/Hello-World
1215thebqtic/IBDiarization
C++ Implementation of the Information Bottleneck System
1215thebqtic/kaldi-websocket-python
Simple websocket server
1215thebqtic/Leetcode
Solution in Java
1215thebqtic/pyannote-audio
Speaker diarization
1215thebqtic/pyannote-db-librispeech
LibriSpeech plugin for pyannote.database
1215thebqtic/pyannote-metrics
A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
1215thebqtic/python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
1215thebqtic/returnn
The RWTH extensible training framework for universal recurrent neural networks
1215thebqtic/returnn-experiments
experiments with RETURNN
1215thebqtic/rnnoise-java
1215thebqtic/speaker-change-detection
Paper: https://arxiv.org/abs/1702.02285
1215thebqtic/speaker-diarization
Speaker diarization scripts, based on AaltoASR
1215thebqtic/tagging
search Web documents, more precisely, bookmarks based on tags
1215thebqtic/THE-2020-PERSONALIZED-VOICE-TRIGGER-CHALLENGE-BASELINE-SYSTEM
1215thebqtic/training-data-analyst
Labs and demos for courses for GCP Training (http://cloud.google.com/training).
1215thebqtic/VBDiarization
Speaker diarization based on python implementation from http://voicebiometry.org/
1215thebqtic/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit