non-autoregressive

There are 34 repositories under non-autoregressive topic.

lucidrains/soundstorm-pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Language:Python1.5k 50 2492
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python1.1k 14 1786
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Language:Jupyter Notebook1.1k 16 96156
keonlee9420/PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Language:Python341 21 2938
keonlee9420/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Language:Python338 10 2745
keonlee9420/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
Language:Python326 12 2042
keonlee9420/Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Language:Python306 4 2046
keonlee9420/DiffSinger
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
Language:Python243 4 830
keonlee9420/DailyTalk
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
Language:Python239 8 313
keonlee9420/StyleSpeech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Language:Python195 6 1723
keonlee9420/Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Language:Python194 6 1727
keonlee9420/Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Language:Python190 14 1944
HKUNLP/diffusion-of-thoughts
[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
Language:Python178 6 414
xcfcode/What-I-Have-Read
Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers
165 6 015
keonlee9420/Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
Language:Python146 10 519
HKUNLP/reparam-discrete-diffusion
Reparameterized Discrete Diffusion Models for Text Generation
Language:Python101 2 43
henry-yeh/GLOP
[AAAI 2024] GLOP: Learning Global Partition and Local Construction for Solving Large-scale Routing Problems in Real-time
Language:Python93 2 514
ictnlp/NAST-S2x
A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
Language:Python75 4 34
keonlee9420/VAENAR-TTS
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
Language:Python73 4 314
keonlee9420/FastPitchFormant
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Language:Python72 2 314
keonlee9420/WaveGrad2
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Language:Python69 6 518
bearcatt/LaBERT
A length-controllable and non-autoregressive image captioning model.
Language:Python68 5 1712
keonlee9420/Daft-Exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Language:Python57 3 713
jxzhangjhu/awesome-LLM-controlled-decoding-generation
awesome-LLM-controlled-constrained-generation
50 1 03
hemingkx/SpecDec
Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)
Language:Python44 2 31
HKUNLP/DiffuSearch
[ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"
Language:Python33 5 21
yzhangcs/ctc-copy
[EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".
Language:Python20 1 24
keonlee9420/Deep-Learning-TTS-Template
This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).
Language:Python15 1 0
kan-bayashi/NonARSeq2SeqVC
Non-autoregressive sequence-to-sequence voice conversion
6 2 00
RistoAle97/ContinualNAT
M.Sc. thesis on Continual Learning for Non-Autoregressive Neural Machine Translation
Language:Python6 2 150
ducnt18121997/Viet-Transformer-TTS
This is PyTorch Implementation of A Non-Autoregressive Transformer with unsupervised learning durations based on Transformer & Conformer blocks, supporting for Vietnamese language.
Language:Python5 1 00
aistairc/BERT-NAR-BERT
BERT-based pre-trained non-autoregressive sequence-to-sequence model
Language:Python3 5 21
LARC-CMU-SMU/Enconter
Implementation of 2021 EACL paper Enconter
Language:Jupyter Notebook2 4 00
mahshid1378/Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Language:Python1

non-autoregressive

lucidrains/soundstorm-pytorch

ictnlp/StreamSpeech

shivammehta25/Matcha-TTS

keonlee9420/PortaSpeech

keonlee9420/DiffGAN-TTS

keonlee9420/Comprehensive-Transformer-TTS

keonlee9420/Expressive-FastSpeech2

keonlee9420/DiffSinger

keonlee9420/DailyTalk

keonlee9420/StyleSpeech

keonlee9420/Cross-Speaker-Emotion-Transfer

keonlee9420/Parallel-Tacotron2

HKUNLP/diffusion-of-thoughts

xcfcode/What-I-Have-Read

keonlee9420/Comprehensive-E2E-TTS

HKUNLP/reparam-discrete-diffusion

henry-yeh/GLOP

ictnlp/NAST-S2x

keonlee9420/VAENAR-TTS

keonlee9420/FastPitchFormant

keonlee9420/WaveGrad2

bearcatt/LaBERT

keonlee9420/Daft-Exprt

jxzhangjhu/awesome-LLM-controlled-decoding-generation

hemingkx/SpecDec

HKUNLP/DiffuSearch

yzhangcs/ctc-copy

keonlee9420/Deep-Learning-TTS-Template

kan-bayashi/NonARSeq2SeqVC

RistoAle97/ContinualNAT

ducnt18121997/Viet-Transformer-TTS

aistairc/BERT-NAR-BERT

LARC-CMU-SMU/Enconter

mahshid1378/Parallel-Tacotron2