TitiAffandi

Pinned Repositories

asr-evaluation
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Language:Python0 1 00
conformer
Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.
Language:Python0 0 00
conformerLucidrains
Implementation of the convolutional module from the Conformer paper, for use in Transformers
Language:Python0 0 00
conformerModel
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Language:Python0 0 00
id-nlp-resource
A list of Indonesian NLP resources.
0 1 00
ivector-xvector
Extract xvector and ivector under kaldi
Language:Shell0 1 00
maps_reproducible
Reproducible Research documentation for MaPS-f0
Language:MATLAB00
mcd
Mel cepstral distortion (MCD) computations in python.
Language:Python0 1 00
mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
Language:Jupyter Notebook0 1 00
MOSNet
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
Language:Python0 0 00

TitiAffandi's Repositories

TitiAffandi/asr-evaluation
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Language:Python0 1 00
TitiAffandi/conformer
Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.
Language:Python0 0 00
TitiAffandi/conformerLucidrains
Implementation of the convolutional module from the Conformer paper, for use in Transformers
Language:Python0 0 00
TitiAffandi/conformerModel
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Language:Python0 0 00
TitiAffandi/id-nlp-resource
A list of Indonesian NLP resources.
0 1 00
TitiAffandi/ivector-xvector
Extract xvector and ivector under kaldi
Language:Shell0 1 00
TitiAffandi/maps_reproducible
Reproducible Research documentation for MaPS-f0
Language:MATLAB00
TitiAffandi/mcd
Mel cepstral distortion (MCD) computations in python.
Language:Python0 1 00
TitiAffandi/mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
Language:Jupyter Notebook0 1 00
TitiAffandi/MOSNet
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
Language:Python0 0 00
TitiAffandi/multi-speaker-tacotron
VCTK multi-speaker tacotron for ICASSP 2020
TitiAffandi/multi-speaker-tacotron-tensorflow
Multi-speaker Tacotron in TensorFlow.
Language:Python1 0
TitiAffandi/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
TitiAffandi/probing-TTS-models
Link to paper: https://arxiv.org/abs/1912.10915
Language:Jupyter Notebook1 0
TitiAffandi/pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
Language:Perl1 0
TitiAffandi/PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
TitiAffandi/sas-python-work
Language:Jupyter Notebook1 0
TitiAffandi/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language:Jupyter Notebook1 0
TitiAffandi/tacotron2-ZS
Language:Jupyter Notebook
TitiAffandi/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Language:Python1 0
TitiAffandi/waveglow
A Flow-based Generative Network for Speech Synthesis
TitiAffandi/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python0 0
TitiAffandi/whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
Language:Python0 0