Durgesh92

I AM +Mumbai

Pinned Repositories

ai_intern
0 1 00
android-wav
Used to record audio in wav format in android
Language:Java0 0 00
ASR-hybrid-decoding
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs. The output is a mix of in-vocabulary words and phoneme sequences. This decoding is suitable for systems with only a small dictionary available and for further recovery of OOV words.
Language:Shell0 2 00
asr_wake_demo
0 2 00
aubio
a library for audio and music analysis
Language:C0 1 00
AudioRecorder-IOS
Language:Swift0 2 00
ios_offline_asr
Offline ASR Library for IOS
Language:Swift1 2 01
KaldiWebrtcServer
Python server for communicating with Kaldi from the browser using WebRTC
Language:Python1 1 00
Location-Recommendation
Language:Java1 2 00
ws_bridge_asterisk
Language:JavaScript2 2 01

Durgesh92's Repositories

Durgesh92/ws_bridge_asterisk
Language:JavaScript2 2 01
Durgesh92/ai_intern
0 1 00
Durgesh92/android-wav
Used to record audio in wav format in android
Language:Java0 0 00
Durgesh92/asr_wake_demo
0 2 00
Durgesh92/Awesome-Talking-Face
📖 A curated list of resources dedicated to talking face.
0 0 00
Durgesh92/build-an-avatar-with-ASR-TTS-Transformer-Omniverse-Audio2Face
Language:Jupyter Notebook0 0 00
Durgesh92/DTLN
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Language:Python0 0
Durgesh92/Durgesh92
Config files for my GitHub profile.
1 0
Durgesh92/EfficientWord-Net
OneShot Learning-based hotword detection.
Language:Jupyter Notebook1 0
Durgesh92/FakeYou-Tacotron2-Notebook
Tacotron2 Training Notebook for FakeYou.com
Language:Jupyter Notebook0 0
Durgesh92/flutter_kaldi_asr_plugin
Flutter plugin wrapping Kaldi libraries
Language:C++1 0
Durgesh92/Goodness-of-Pronunciation-Pipelines-for-OOV-Problem
Goodness of Pronunciation Pipelines for OOV Removal
Language:Perl0 0
Durgesh92/gop-pykaldi
Goodness of Pronunciation algorithm using PyKaldi
Language:Python1 0
Durgesh92/gopt
Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
Language:Python0 0
Durgesh92/kaldi_pa_fix
Language:Shell2 0
Durgesh92/lazykh
Source code for the automatic lip-syncing project described in this video! https://www.youtube.com/watch?v=y3B8YqeLCpY
Language:Python1 0
Durgesh92/mbert_hi_IN_finetuning
1 0
Durgesh92/normalise
A module for normalising text.
Language:Python1 0
Durgesh92/nvidia_conformer_training
1 0
Durgesh92/ondevice_graph_android_demo
Language:Shell1 0
Durgesh92/python_asr_client
Language:Python1 0
Durgesh92/smart-dog-collor
Language:C++2 1
Durgesh92/snowman
Snowboy reimplementation
Language:C++1 0
Durgesh92/soc_mfcc_cnn
A Soc for KWS
Language:Jupyter Notebook1 0
Durgesh92/sre
Language:Shell2 0
Durgesh92/TensorFlowLiteEmotionDemo
在Android上运行人脸表情识别的tflite模型
Language:Java0 0
Durgesh92/tflite-kws
Keyword Spotting (KWS) API wrapper for TFLite streaming models.
Language:Python1 0
Durgesh92/ULCA-asr-dataset-corpus
1 0
Durgesh92/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Language:C++1 0
Durgesh92/whatsapp_webhook_gpt
Language:Python1 0