Patchethium's Stars
typst/typst
A new markup-based typesetting system that is powerful and easy to learn.
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
ggerganov/ggml
Tensor library for machine learning
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
bitshifter/glam-rs
A simple and fast linear algebra library for games and graphics
photosynthesis-team/piq
Measures and metrics for image2image tasks. PyTorch.
kobaltedev/kobalte
A UI toolkit for building accessible web apps and design systems with SolidJS.
wenet-e2e/speech-synthesis-paper
List of speech synthesis papers.
pykeio/ort
Fast ML inference & training for Rust with ONNX Runtime
gemelo-ai/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
ikawaha/kagome
Self-contained Japanese Morphological Analyzer written in pure Go
Kyubyong/g2p
g2p: English Grapheme To Phoneme Conversion
avaneev/r8brain-free-src
High-quality pro audio resampler / sample rate conversion C++ library. Very fast, for both audio resampling and time-series interpolation.
ddlBoJack/Speech-Resources
语音方向实验室/公司/资源/实习等,欢迎推荐或自荐
Emotional-Text-to-Speech/dl-for-emo-tts
:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:
kakaobrain/g2pm
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
keonlee9420/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
GitYCC/g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
maum-ai/univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
rishikksh20/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
asuni/wavelet_prosody_toolkit
espnet/espnet_onnx
Onnx wrapper for espnet infrernce model
revsic/torch-nansypp
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
yl4579/AuxiliaryASR
Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
Emotional-Text-to-Speech/hmm-for-emo-tts
:computer: A repository with comprehensive instructions for using the Festvox toolkit for generating Emotional speech :speaker: from text
prosodylab/prosodylab.dictionaries
A repository for dictionaries to be used with the Prosodylab-Aligner
raa0121/translate-discord-bot
Discord translate bot
VOICEVOX/open_jtalk-rs
daac-tools/rucrf
Conditional Random Fields implemented in pure Rust
OYCN/OnnxEditorV2
A Qt based Visual Editor for ONNX