Pinned Repositories
ASR_2018_T01
Example repository for 2018 DS/NC 821 / Automatic Speech Recognition projects
attention
A list of references for end-to-end ASR
BMSCE_workshop
cs224s
CS224S / LINGUIST285 - Spoken Language Processing
ctc-segmentation
CTC segmentation python package
deep_asr
my learnings on implementing deep ASR models
espnet
My changes to ESPnet
KDataScience2017
ML1
This project contains assignments, project and solutions for the exam of the course "2017 CS/DS 706 Machine Learning"
Online-Speech-Recognition
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
sknadig's Repositories
sknadig/cs224s
CS224S / LINGUIST285 - Spoken Language Processing
sknadig/2020
SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflection
sknadig/ASR-hybrid-decoding
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs. The output is a mix of in-vocabulary words and phoneme sequences. This decoding is suitable for systems with only a small dictionary available and for further recovery of OOV words.
sknadig/avatarify
Avatars for Zoom and Skype
sknadig/DLASR_lecture
sknadig/EmuServerPython
Implementation of an EMU-webApp websocket server in Python
sknadig/espnet_tts_frontend
Text frontend for ESPnet tts recipes
sknadig/fast_align
Simple, fast unsupervised word aligner
sknadig/first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
sknadig/google-offline-speech-recognition
This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by replicating it on any system that supports tensorflow.
sknadig/grpc_playground
sknadig/hey_snips
sknadig/home-assistant
:house_with_garden: Open source home automation that puts local control and privacy first
sknadig/indic-trans
The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including English.
sknadig/indic_nlp_library
Resources and tools for Indian language Natural Language Processing
sknadig/kaldi
This is the official location of the Kaldi project.
sknadig/LeFlow
Enabling Flexible FPGA High-Level Synthesis of Tensorflow Deep Neural Networks
sknadig/lingvo
Lingvo
sknadig/live-transcribe-speech-engine
Live Transcribe is an Android application that provides real-time captioning for people who are deaf or hard of hearing. This repository contains the Android client libraries for communicating with Google's Cloud Speech API that are used in Live Transcribe.
sknadig/matrix-creator-fpga
Reference HDL code for the MATRIX Creator's Spartan 6 FPGA
sknadig/Matrix-Voice-ESP32-MQTT-Audio-Streamer
The repo has implementing an esp32 standalone MQTT audio streamer for the Matrix Voice, a 8 mic array board with a ledring. See https://www.matrix.one/products/voice. The software can be used with Rhasspy or Snips (depricated due to snips takeover by Sonos)
sknadig/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
sknadig/odas
ODAS: Open embeddeD Audition System
sknadig/rhasspy
Rhasspy voice assistant for offline home automation
sknadig/RIMs
Code for "Recurrent Independent Mechanisms"
sknadig/snowboy
DNN based hotword and wake word detection toolkit
sknadig/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
sknadig/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
sknadig/tensorflow_frame_level
sknadig/Voice-Privacy-Challenge-2020
Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/docs/VoicePrivacy_2020_Eval_Plan_v1_1.pdf