SynthAether

Enthusiast of neural synthesizers and vocoders, always curious.

Pinned Repositories

AutoVocoder
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
Language:Python2 1 00
Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models, a darkhorse in the field of Generative Models
Language:HTML2 1 01
diffusion-audio-restoration-nvidia-SR
Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.
Language:Python10
F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python0 0 00
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python10
golf_diff_Glottal_Flow_LPC_synthesis
A DDSP-based neural vocoder.
Language:Jupyter Notebook0 0 00
MB-iSTFT-VITS2_super-monotonic-align
Application of MB-iSTFT-VITS components to vits2_pytorch
Language:Python1 0 00
tacospawn
PyTorch implementation of TacoSpawn, Speaker Generation
Language:Python8 3 03
unconditional-diff-STFT
Unconditional music synthesis using a diffusion model in the STFT domain
Language:Jupyter Notebook6 0 03
WaveletAttention
Wavelet-Attention CNNs for Image Classification
Language:Python10 0 01

SynthAether's Repositories

SynthAether/eben
Repo for source code of EBEN: Extreme Bandwidth Extension Network
Language:Python1 0 0
SynthAether/Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
1 0 0
SynthAether/OpenVoice
Instant voice cloning by MyShell
Language:Python1 0 0
SynthAether/alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
Language:HTML0 0
SynthAether/asmgen_SIMD
Generator for select AVX/AVX2/FMA/AVX512/NEON/SVE/RVV inline assembly instructions for use with C/C++
Language:Python0 0
SynthAether/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python
SynthAether/basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Language:Python0 0
SynthAether/build-nanogpt
Video+code lecture on building nanoGPT from scratch
Language:Python0 0
SynthAether/DDSP-SVC
End-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
Language:Python
SynthAether/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python1 0
SynthAether/DEX-TTS
DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability
Language:Python0 0
SynthAether/genmusic_demo_list
a list of demo websites for automatic music generation research
SynthAether/LLaMA-Factory
Unify Efficient Fine-tuning of 100+ LLMs
Language:Python0 0
SynthAether/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook0 0
SynthAether/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda
SynthAether/NeMo
NeMo: a toolkit for conversational AI
Language:Python0 0
SynthAether/normalizing-flows
PyTorch implementation of normalizing flow models
Language:Python0 0
SynthAether/RWKV-LM
RWKV is a RNN with transformer-level performance. It can be directly trained like a GPT transformer (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python0 0
SynthAether/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook0 0
SynthAether/seed-tts-eval
Language:Python0 0
SynthAether/sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
Language:Python0 0
SynthAether/sgmse_Speech-Enhancement-and-Dereverberation-with-Diffusion-based-Generative-Models
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
Language:Python0 0
SynthAether/stable-audio-tools
Generative models for conditional audio generation
Language:Python0 0
SynthAether/STFT
[c++]STFT, ISTFT, mel-filterbank modules
Language:Jupyter Notebook0 0
SynthAether/StreamSpeech_SpeechTranslation
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python0 0
SynthAether/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Language:Python0 0
SynthAether/torbi_Viterbi_decoding_in_PyTorch
Viterbi decoding in PyTorch
Language:Python0 0
SynthAether/torchlpc_LPC
LPC with Pytoch
Language:Python0 0
SynthAether/utmos
A toolkit to calculate speech audio quality. Not affiliated with the original authors
Language:Python0 0
SynthAether/x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers
Language:Python0 0