Pinned Repositories
APLUS_track
arabic_pronounce
Pronounce Arabic words
asr_labs
ASR labs
Bag-of-Visual-Words
This has he BoVW model to classify the images of same object together among: airplanes, bikes, cars, faces.
Best-README-Template
An awesome README template to jumpstart your projects!
camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
Text-Binarization
widdiot's Repositories
widdiot/Bag-of-Visual-Words
This has he BoVW model to classify the images of same object together among: airplanes, bikes, cars, faces.
widdiot/Text-Binarization
widdiot/APLUS_track
widdiot/arabic_pronounce
Pronounce Arabic words
widdiot/asr_labs
ASR labs
widdiot/Best-README-Template
An awesome README template to jumpstart your projects!
widdiot/camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
widdiot/ctcdecode
PyTorch CTC Decoder bindings
widdiot/da-lang-id
Domain Adaptation for Spoken Language ID
widdiot/demo
example code for remind myself, especial the api
widdiot/Digit-Recognition
A CNN LeNet model to classify images of digits as 0 - 9.
widdiot/E2E-ASR
PyTorch Implementations for End-to-End Automatic Speech Recognition
widdiot/EEND
End-to-End Neural Diarization
widdiot/kaldi
This is the official location of the Kaldi project.
widdiot/kaldi-postproc
widdiot/marytts-lexicon-de
German lexicon for MaryTTS
widdiot/neural_sp
End-to-end ASR/LM implementation with PyTorch
widdiot/pika
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
widdiot/pychain_example
widdiot/pytorch-streamloader
widdiot/speech-training-recorder
Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.
widdiot/spoteno
Spoken text normalization for asr
widdiot/TIPR-assignment-1
widdiot/TIPR_ASSIGNMENT_2
widdiot/triplet-entropy-loss
Project repository for the work done in Triplet Entropy Loss: Improving The Generalization of Short Speech Language Identification Systems
widdiot/Tuplemax-Loss
Unofficial implementation of pairwise tuplemax loss. TUPLEMAX LOSS FOR LANGUAGE IDENTIFICATION https://arxiv.org/pdf/1811.12290.pdf Eq. (2). works only for batch_size = 1
widdiot/UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
widdiot/VGG-Speaker-Recognition
Utterance-level Aggregation For Speaker Recognition In The Wild
widdiot/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
widdiot/youtube-dl
Command-line program to download videos from YouTube.com and other video sites