SynthAether

Enthusiast of neural synthesizers and vocoders, always curious.

Pinned Repositories

AutoVocoder
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
Language:Python2 1 00
Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models, a darkhorse in the field of Generative Models
Language:HTML2 1 01
diffusion-audio-restoration-nvidia-SR
Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.
Language:Python10
F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python0 0 00
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python10
golf_diff_Glottal_Flow_LPC_synthesis
A DDSP-based neural vocoder.
Language:Jupyter Notebook0 0 00
MB-iSTFT-VITS2_super-monotonic-align
Application of MB-iSTFT-VITS components to vits2_pytorch
Language:Python1 0 00
tacospawn
PyTorch implementation of TacoSpawn, Speaker Generation
Language:Python8 3 03
unconditional-diff-STFT
Unconditional music synthesis using a diffusion model in the STFT domain
Language:Jupyter Notebook6 0 03
WaveletAttention
Wavelet-Attention CNNs for Image Classification
Language:Python10 0 01

SynthAether's Repositories

SynthAether/emotion-annotations
Language:Python1
SynthAether/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python10
SynthAether/spear-tts-pytorch
An unofficial PyTorch implementation of SPEAR-TTS.
Language:Jupyter Notebook1 0 0
SynthAether/swift-f0_pitch
Fast and accurate fundamental frequency (F0) detector using convolutional neural networks
Language:Python1
SynthAether/Bert-VITS2
vits2 backbone with bert
Language:Python0 0 00
SynthAether/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python0 0 00
SynthAether/aimet_quant
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
Language:Python
SynthAether/ARC-Encoder
SynthAether/BlaGPT
Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible experimentation and exploration.
Language:Python
SynthAether/CFrame_ZX
Simple C framework for the ZX Spectrum Next
Language:Assembly
SynthAether/Chatterbox-TTS-Extended
Modified version of Chatterbox that accepts text files as input and no character restrictions
Language:Python
SynthAether/ComfyUI-VibeVoice
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
Language:Python
SynthAether/CosyVoice_TTS
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python
SynthAether/DiaMoE-TTS
Official code for"DiaMoE-TTS: A Unified IPA-based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptation"
SynthAether/FireRedTTS2
Long-form streaming TTS system for multi-speaker dialogue generation
Language:Python
SynthAether/fish-speech
Brand new TTS solution
Language:Python0 0
SynthAether/flash-attention
Language:Python0 0
SynthAether/index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Language:Python
SynthAether/mair-hub
Language:Jupyter Notebook
SynthAether/nanochat
The best ChatGPT that $100 can buy.
SynthAether/NeMo
NeMo: a toolkit for conversational AI
Language:Python1 0
SynthAether/NeMoTTS
Language:Python
SynthAether/next-3D
A 3D library for the ZX Spectrum Next
Language:Assembly
SynthAether/ParaStyleTTS
This is the official code for ACM CIKM 2025 Paper: ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive Text-to-Speech Generation
SynthAether/RWKV_TTS
This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).
Language:Python
SynthAether/SAC_semantic
Trainging, inference, and testing of the SAC speech codec model.
SynthAether/stylish-tts
High quality text-to-speech based on StyleTTS 2.
Language:Python
SynthAether/TTS-WebUI
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!
Language:TypeScript
SynthAether/UTMOSv2
Language:Python0 0
SynthAether/yt-dlp
A youtube-dl fork with additional features and fixes
Language:Python0 0