SynthAether

Enthusiast of neural synthesizers and vocoders, always curious.

Pinned Repositories

AutoVocoder
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
Language:Python2 1 00
Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models, a darkhorse in the field of Generative Models
Language:HTML2 1 01
diffusion-audio-restoration-nvidia-SR
Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.
Language:Python10
F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python0 0 00
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python10
golf_diff_Glottal_Flow_LPC_synthesis
A DDSP-based neural vocoder.
Language:Jupyter Notebook0 0 00
MB-iSTFT-VITS2_super-monotonic-align
Application of MB-iSTFT-VITS components to vits2_pytorch
Language:Python1 0 00
tacospawn
PyTorch implementation of TacoSpawn, Speaker Generation
Language:Python8 3 03
unconditional-diff-STFT
Unconditional music synthesis using a diffusion model in the STFT domain
Language:Jupyter Notebook6 0 03
WaveletAttention
Wavelet-Attention CNNs for Image Classification
Language:Python10 0 01

SynthAether's Repositories

SynthAether/auraloss
Collection of audio-focused loss functions in PyTorch
Language:Python1 0 0
SynthAether/RAVE
Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder
Language:Python1 1 0
SynthAether/speechbrain_Conversational-AI
A PyTorch-based Speech Toolkit
Language:Python1 0 0
SynthAether/whisperX
WhisperX: Timestamp-Accurate Automatic Speech Recognition.
Language:Python1 0 01
SynthAether/BABE2_music_restoration_enhancement
Language:Python0 0
SynthAether/bark_TTS
🔊 Text-prompted Generative Audio Model
Language:Jupyter Notebook0 0
SynthAether/CML-TTS-Dataset
CML-TTS: A Multilingual Dataset for Speech Synthesis
Language:HTML0 0
SynthAether/dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps"
Language:Python0 0
SynthAether/fairseq2
FAIR Sequence Modeling Toolkit 2
Language:Python0 0
SynthAether/FastBERT
The repository for the code of the FastBERT paper
Language:Python
SynthAether/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
Language:C++0 0
SynthAether/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
Language:Python0 0
SynthAether/Grad-TTS
Implementation of the 'Grad-TTS' with Multilingual Cleaners
Language:Jupyter Notebook0 0
SynthAether/LAVISH
Vision Transformers are Parameter-Efficient Audio-Visual Learners
Language:Python0 0
SynthAether/llama.cpp
Port of Facebook's LLaMA model in C/C++
Language:C++0 0
SynthAether/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python
SynthAether/nansypp
Unofficial implementation of NANSY++ in Pytorch Lightning
Language:Python0 0
SynthAether/Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS)
Language:Jupyter Notebook0 0
SynthAether/penn_Pitch-Estimating-Neural-Networks-
Pitch Estimating Neural Networks (PENN)
Language:Python0 0
SynthAether/PitchSqueezer
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
Language:Python0 0
SynthAether/podcast-summarizer
OpenAI Whisper + davinci for podcast summarization
Language:Jupyter Notebook0 0
SynthAether/ppgs_High-Fidelity-Neural-Phonetic-Posteriorgrams
High-Fidelity Neural Phonetic Posteriorgrams
Language:Python0 0
SynthAether/praat
Praat: Doing Phonetics By Computer
Language:C
SynthAether/pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Language:Python2 0
SynthAether/tango
Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"
Language:Python
SynthAether/tts-arabic-pytorch
TTS models for Arabic (Tacotron2, FastPitch)
Language:Jupyter Notebook
SynthAether/vitsgpt-vits
the code for vits in the vitsGPT project
Language:Jupyter Notebook0 0
SynthAether/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook0 0
SynthAether/VoRAS_VC
VoRAS: Vocos Retrieval and self-Augmentation for Speech
Language:Python0 0
SynthAether/XPhoneBERT
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
Language:Python0 0