widdiot

Speech Research Engineer

gnani.aiBengaluru

Pinned Repositories

APLUS_track
0 1 00
arabic_pronounce
Pronounce Arabic words
Language:Python0 0 00
asr_labs
ASR labs
Language:Jupyter Notebook0 0 00
Bag-of-Visual-Words
This has he BoVW model to classify the images of same object together among: airplanes, bikes, cars, faces.
Language:Jupyter Notebook1 1 00
Best-README-Template
An awesome README template to jumpstart your projects!
0 0 00
camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
Language:Python0 0 00
Text-Binarization
Language:Python1 0 10

widdiot's Repositories

widdiot/Bag-of-Visual-Words
This has he BoVW model to classify the images of same object together among: airplanes, bikes, cars, faces.
Language:Jupyter Notebook1 1 00
widdiot/Text-Binarization
Language:Python1 0 10
widdiot/APLUS_track
0 1 00
widdiot/arabic_pronounce
Pronounce Arabic words
Language:Python0 0 00
widdiot/asr_labs
ASR labs
Language:Jupyter Notebook0 0 00
widdiot/Best-README-Template
An awesome README template to jumpstart your projects!
0 0 00
widdiot/camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
Language:Python0 0 00
widdiot/ctcdecode
PyTorch CTC Decoder bindings
Language:C++0 0
widdiot/da-lang-id
Domain Adaptation for Spoken Language ID
Language:Python0 0
widdiot/demo
example code for remind myself, especial the api
Language:Python0 0
widdiot/Digit-Recognition
A CNN LeNet model to classify images of digits as 0 - 9.
Language:Python0 0
widdiot/E2E-ASR
PyTorch Implementations for End-to-End Automatic Speech Recognition
Language:Python0 0
widdiot/EEND
End-to-End Neural Diarization
Language:Python0 0
widdiot/kaldi
This is the official location of the Kaldi project.
Language:Shell0 0
widdiot/kaldi-postproc
Language:Python1 0
widdiot/marytts-lexicon-de
German lexicon for MaryTTS
0 0
widdiot/neural_sp
End-to-end ASR/LM implementation with PyTorch
Language:Python0 0
widdiot/pika
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
Language:Python0 0
widdiot/pychain_example
Language:Python0 0
widdiot/pytorch-streamloader
Language:Python0 0
widdiot/speech-training-recorder
Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.
Language:Python0 0
widdiot/spoteno
Spoken text normalization for asr
Language:Python0 0
widdiot/TIPR-assignment-1
Language:Jupyter Notebook0 0
widdiot/TIPR_ASSIGNMENT_2
Language:Jupyter Notebook0 0
widdiot/triplet-entropy-loss
Project repository for the work done in Triplet Entropy Loss: Improving The Generalization of Short Speech Language Identification Systems
Language:Python0 0
widdiot/Tuplemax-Loss
Unofficial implementation of pairwise tuplemax loss. TUPLEMAX LOSS FOR LANGUAGE IDENTIFICATION https://arxiv.org/pdf/1811.12290.pdf Eq. (2). works only for batch_size = 1
Language:Python1 1
widdiot/UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Language:Forth0 0
widdiot/VGG-Speaker-Recognition
Utterance-level Aggregation For Speaker Recognition In The Wild
Language:Python0 0
widdiot/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:Python0 0
widdiot/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
Language:Python0 0

widdiot

Pinned Repositories

APLUS_track

arabic_pronounce

asr_labs

Bag-of-Visual-Words

Best-README-Template

camel_tools

Text-Binarization

widdiot's Repositories

widdiot/Bag-of-Visual-Words

widdiot/Text-Binarization

widdiot/APLUS_track

widdiot/arabic_pronounce

widdiot/asr_labs

widdiot/Best-README-Template

widdiot/camel_tools

widdiot/ctcdecode

widdiot/da-lang-id

widdiot/demo

widdiot/Digit-Recognition

widdiot/E2E-ASR

widdiot/EEND

widdiot/kaldi

widdiot/kaldi-postproc

widdiot/marytts-lexicon-de

widdiot/neural_sp

widdiot/pika

widdiot/pychain_example

widdiot/pytorch-streamloader

widdiot/speech-training-recorder

widdiot/spoteno

widdiot/TIPR-assignment-1

widdiot/TIPR_ASSIGNMENT_2

widdiot/triplet-entropy-loss

widdiot/Tuplemax-Loss

widdiot/UHV-OTS-Speech

widdiot/VGG-Speaker-Recognition

widdiot/wenet

widdiot/youtube-dl