SynthAether

Enthusiast of neural synthesizers and vocoders, always curious.

Pinned Repositories

AutoVocoder
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
Language:Python2 1 00
Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models, a darkhorse in the field of Generative Models
Language:HTML2 1 01
diffusion-audio-restoration-nvidia-SR
Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.
Language:Python10
F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python0 0 00
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python10
golf_diff_Glottal_Flow_LPC_synthesis
A DDSP-based neural vocoder.
Language:Jupyter Notebook0 0 00
MB-iSTFT-VITS2_super-monotonic-align
Application of MB-iSTFT-VITS components to vits2_pytorch
Language:Python1 0 00
tacospawn
PyTorch implementation of TacoSpawn, Speaker Generation
Language:Python8 3 03
unconditional-diff-STFT
Unconditional music synthesis using a diffusion model in the STFT domain
Language:Jupyter Notebook6 0 03
WaveletAttention
Wavelet-Attention CNNs for Image Classification
Language:Python10 0 01

SynthAether's Repositories

SynthAether/snac_Multi-Scale-Neural-Audio-Codec
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
Language:Python1 0 0
SynthAether/Bert-VITS2
vits2 backbone with bert
Language:Python0 0 00
SynthAether/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python0 0 00
SynthAether/AAREfficient-Autoregressive-Audio-Modeling-via-Next-Scale-Prediction
[Official Implementation] Acoustic Autoregressive Modeling 🔥
Language:Python0 0
SynthAether/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python0 0
SynthAether/f5-tts-mlx
Implementation of F5-TTS in MLX
Language:Python0 0
SynthAether/fish-speech
Brand new TTS solution
Language:Python0 0
SynthAether/GeneFace
Official Pytorch Implementation of GeneFace (ICLR 2023)
Language:Python0 0
SynthAether/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python0 0
SynthAether/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python0 0
SynthAether/hertz-dev
first base model for full-duplex conversational audio
Language:Python
SynthAether/highway_SIMD
Performance-portable, length-agnostic SIMD with runtime dispatch
Language:C++
SynthAether/lpc_vocoder
Vocoder LPC for speech signals
Language:Python0 0
SynthAether/MagVITS
VITS with phoneme-level prosody modeling based on MaskGIT
Language:Python0 0
SynthAether/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
Language:Python0 0
SynthAether/moshi
Language:Python0 0
SynthAether/OuteTTS
Language:Python0 0
SynthAether/piper_larynx2_vits_TTS_cpp_onnx
A fast, local neural text to speech system
Language:C++
SynthAether/PyTorch-Wavelet-Toolbox
Differentiable fast wavelet transforms in PyTorch with GPU support.
Language:Python0 0
SynthAether/rfwave_vocoder
Language:Python0 0
SynthAether/rotary-embedding-torch
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch
Language:Python0 0
SynthAether/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity and Number Detector
Language:Python0 0
SynthAether/simple-tts
（WIP）
Language:Python0 0
SynthAether/StableTTS
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
Language:Python0 0
SynthAether/sttatts
Language:Python0 0
SynthAether/super-monotonic-align_MAS
Language:Python0 0
SynthAether/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook0 0
SynthAether/ultravox
Language:Python
SynthAether/wavefit-pytorch
PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.
Language:Python0 0
SynthAether/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Language:Python0 0