craigbaker's Stars
readbeyond/aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
CPJKU/madmom
Python audio and music signal processing library
bootphon/phonemizer
Simple text to phones converter for multiple languages
NVIDIA/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
drethage/speech-denoising-wavenet
A neural network for end-to-end speech denoising
AI-Guru/music-generation-research
A straightforward collection of Music Generation research resources.
CUNY-CL/wikipron
Massively multilingual pronunciation mining
Xmader/musescore-dataset
The dataset of all music sheets and users on musescore.com (unmaintained/discontinued since Sep 30, 2021)
sarulab-speech/jtubespeech
fosfrancesco/asap-dataset
A dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.
emanuelhuber/RGPR
Ground-penetrating radar (GPR) data processing and visualisation: a free and open-source software package (R language)
KeSpeech/KeSpeech
The repo provides information about KeSpeech dataset.
syang1993/FFTNet
A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder
mpariente/pytorch_stoi
STOI loss function in PyTorch
laboroai/LaboroTVSpeech
PaulleDemon/tkVideoPlayer
Video player for tkinter.
g0v/moedict-data-twblg
臺灣閩南語常用詞辭典 資料檔
CanCLID/ToJyutping
粵語拼音自動標註工具 Cantonese Pronunciation Automatic Labeling Tool
besacier/AMMIcourse
dbklim/StressRNN
Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLSTM) and the "Grammatical Dictionary" by A. A. Zaliznyak (from http://odict.ru/).
qcri/ArabicASRChallenge2016
This repository
JasonSWFu/End-to-end-waveform-utterance-enhancement
End-to-end waveform utterance enhancement for direct evaluation metrics optimization by fully convolutional neural networks (TASLP 2018)
aryamanarora/schwa-deletion
Code for the ACL 2020 Paper on Schwa Deletion in Hindi and Punjabi
jimpala/torch-wavenet
PyTorch implementation of DeepMind Wavenet paper.
shifaspv/SE-FFTNet-tensorflow-implemenatation
SE-FFTNet: a new feature extraction pattern for end-to-end speech enhancement
liesenf/MYCanCor
Malaysia Cantonese Corpus (MYCanCor) - A video corpus of natural Cantonese conversations
NWU-MuST/za_lex
Lexical pronunciation resources for TTS in South African languages
AsoSoft/Kurdish-G2P-dataset
Datasets for evaluation of Central Kurdish Grapheme-to-Phoneme Conversion systems
ftyers/fieldasr
tawantinsuyurunakunarayku/QillqaqHMMmodel
ASR for quechua language built with HMM