karkirowle

Pathological Speech Synthesis | @Nagoya University

karkirowle's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python79k 636 09.5k
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Language:Python10.1k 134 52862
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python9.6k 134 1.1k1.5k
mamba-org/mamba
The Fast Cross-Platform Package Manager
Language:C++7.2k 45 1.9k386
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python2.4k 51 224430
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Language:C1.9k 103 159304
dexplo/bar_chart_race
Create animated bar chart races in Python with matplotlib
Language:Python1.4k 25 74363
kaegi/alass
"Automatic Language-Agnostic Subtitle Synchronization"
Language:Rust1.1k 28 5058
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Language:Python859 17 142130
NVIDIA/radtts
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.
Language:Roff285 14 3040
jhpoelen/zenodo-upload
upload big files to Zenodo using cURL, jq and bash
Language:Shell258 4 1239
jxzhanggg/nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC
Language:Python249 15 3456
microsoft/P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
Language:HTML214 19 2658
k2kobayashi/crank
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
Language:Python171 9 2831
talhanai/speech-nlp-datasets
Contains links to publicly available datasets for modeling health outcomes using speech and language.
118 5 122
brentspell/torch-yin
Yin pitch estimator in PyTorch
Language:Python114 6 17
idiap/acoustic-simulator
Implementation of audio degradation processes
Language:Python101 15 236
tarepan/VoiceConversionLab
Collect Voice Conversion researches
Language:TypeScript92 1 6477
KunZhou9646/seq2seq-EVC
This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage sequence-to-sequence training.
Language:Python84 1 1416
LeoniusChen/Attentions-in-Tacotron
Language:Python69 3 113
articulatory/articulatory
Deep Articulatory Synthesis and Inversion
Language:Python47 5 24
r9y9/jsut-lab
HTS-style full-context labels for JSUT v1.1
46 4 12
nils-werner/pymushra
pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.
Language:Python43 4 56
KunZhou9646/controllable_evc_code
This is the code for controllable EVC framework for seen and unseen emotion generation.
Language:Python42 3 514
maclandrol/FisherExact
Fisher exact test for mxn contingency table in python
Language:Fortran38 4 1512
MingjieChen/LowResourceVC
Voice conversion training with 109 speakers with limited training samples
Language:Python35 4 45
6gsn/marine
Language:Python33 4 52
jdvala/zoom_audio_transcribe
Zoom Audio Transcription offline
Language:Python32 4 56
soumimaiti/speechlmscore_tool
Language:Python29 4 12
stoneMo/ASVspoof
Language:Python20 4 12

karkirowle

karkirowle's Stars

openai/whisper

AIGC-Audio/AudioGPT

speechbrain/speechbrain

mamba-org/mamba

asteroid-team/asteroid

julius-speech/julius

dexplo/bar_chart_race

kaegi/alass

wenet-e2e/wespeaker

NVIDIA/radtts

jhpoelen/zenodo-upload

jxzhanggg/nonparaSeq2seqVC_code

microsoft/P.808

k2kobayashi/crank

talhanai/speech-nlp-datasets

brentspell/torch-yin

idiap/acoustic-simulator

tarepan/VoiceConversionLab

KunZhou9646/seq2seq-EVC

LeoniusChen/Attentions-in-Tacotron

articulatory/articulatory

r9y9/jsut-lab

nils-werner/pymushra

KunZhou9646/controllable_evc_code

maclandrol/FisherExact

MingjieChen/LowResourceVC

6gsn/marine

jdvala/zoom_audio_transcribe

soumimaiti/speechlmscore_tool

stoneMo/ASVspoof