Pinned Repositories
ai_intern
android-wav
Used to record audio in wav format in android
ASR-hybrid-decoding
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs. The output is a mix of in-vocabulary words and phoneme sequences. This decoding is suitable for systems with only a small dictionary available and for further recovery of OOV words.
asr_wake_demo
aubio
a library for audio and music analysis
AudioRecorder-IOS
ios_offline_asr
Offline ASR Library for IOS
KaldiWebrtcServer
Python server for communicating with Kaldi from the browser using WebRTC
Location-Recommendation
ws_bridge_asterisk
Durgesh92's Repositories
Durgesh92/ws_bridge_asterisk
Durgesh92/ai_intern
Durgesh92/android-wav
Used to record audio in wav format in android
Durgesh92/asr_wake_demo
Durgesh92/Awesome-Talking-Face
📖 A curated list of resources dedicated to talking face.
Durgesh92/build-an-avatar-with-ASR-TTS-Transformer-Omniverse-Audio2Face
Durgesh92/DTLN
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Durgesh92/Durgesh92
Config files for my GitHub profile.
Durgesh92/EfficientWord-Net
OneShot Learning-based hotword detection.
Durgesh92/FakeYou-Tacotron2-Notebook
Tacotron2 Training Notebook for FakeYou.com
Durgesh92/flutter_kaldi_asr_plugin
Flutter plugin wrapping Kaldi libraries
Durgesh92/Goodness-of-Pronunciation-Pipelines-for-OOV-Problem
Goodness of Pronunciation Pipelines for OOV Removal
Durgesh92/gop-pykaldi
Goodness of Pronunciation algorithm using PyKaldi
Durgesh92/gopt
Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
Durgesh92/kaldi_pa_fix
Durgesh92/lazykh
Source code for the automatic lip-syncing project described in this video! https://www.youtube.com/watch?v=y3B8YqeLCpY
Durgesh92/mbert_hi_IN_finetuning
Durgesh92/normalise
A module for normalising text.
Durgesh92/nvidia_conformer_training
Durgesh92/ondevice_graph_android_demo
Durgesh92/python_asr_client
Durgesh92/smart-dog-collor
Durgesh92/snowman
Snowboy reimplementation
Durgesh92/soc_mfcc_cnn
A Soc for KWS
Durgesh92/sre
Durgesh92/TensorFlowLiteEmotionDemo
在Android上运行人脸表情识别的tflite模型
Durgesh92/tflite-kws
Keyword Spotting (KWS) API wrapper for TFLite streaming models.
Durgesh92/ULCA-asr-dataset-corpus
Durgesh92/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Durgesh92/whatsapp_webhook_gpt