non-autoregressive

There are 34 repositories under non-autoregressive topic.

  • lucidrains/soundstorm-pytorch

    Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

    Language:Python1.5k502492
  • ictnlp/StreamSpeech

    StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

    Language:Python1.1k141786
  • Matcha-TTS

    shivammehta25/Matcha-TTS

    [ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

    Language:Jupyter Notebook1.1k1696156
  • keonlee9420/PortaSpeech

    PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

    Language:Python341212938
  • keonlee9420/DiffGAN-TTS

    PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

    Language:Python338102745
  • keonlee9420/Comprehensive-Transformer-TTS

    A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

    Language:Python326122042
  • keonlee9420/Expressive-FastSpeech2

    PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

    Language:Python30642046
  • keonlee9420/DiffSinger

    PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

    Language:Python2434830
  • keonlee9420/DailyTalk

    Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023

    Language:Python2398313
  • keonlee9420/StyleSpeech

    PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

    Language:Python19561723
  • keonlee9420/Cross-Speaker-Emotion-Transfer

    PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

    Language:Python19461727
  • keonlee9420/Parallel-Tacotron2

    PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

    Language:Python190141944
  • HKUNLP/diffusion-of-thoughts

    [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

    Language:Python1786414
  • xcfcode/What-I-Have-Read

    Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers

  • keonlee9420/Comprehensive-E2E-TTS

    A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

    Language:Python14610519
  • HKUNLP/reparam-discrete-diffusion

    Reparameterized Discrete Diffusion Models for Text Generation

    Language:Python101243
  • henry-yeh/GLOP

    [AAAI 2024] GLOP: Learning Global Partition and Local Construction for Solving Large-scale Routing Problems in Real-time

    Language:Python932514
  • ictnlp/NAST-S2x

    A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.

    Language:Python75434
  • keonlee9420/VAENAR-TTS

    PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

    Language:Python734314
  • keonlee9420/FastPitchFormant

    PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

    Language:Python722314
  • keonlee9420/WaveGrad2

    PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

    Language:Python696518
  • bearcatt/LaBERT

    A length-controllable and non-autoregressive image captioning model.

    Language:Python6851712
  • keonlee9420/Daft-Exprt

    PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

    Language:Python573713
  • jxzhangjhu/awesome-LLM-controlled-decoding-generation

    awesome-LLM-controlled-constrained-generation

  • hemingkx/SpecDec

    Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)

    Language:Python44231
  • HKUNLP/DiffuSearch

    [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"

    Language:Python33521
  • yzhangcs/ctc-copy

    [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".

    Language:Python20124
  • keonlee9420/Deep-Learning-TTS-Template

    This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

    Language:Python1510
  • kan-bayashi/NonARSeq2SeqVC

    Non-autoregressive sequence-to-sequence voice conversion

  • RistoAle97/ContinualNAT

    M.Sc. thesis on Continual Learning for Non-Autoregressive Neural Machine Translation

    Language:Python62150
  • ducnt18121997/Viet-Transformer-TTS

    This is PyTorch Implementation of A Non-Autoregressive Transformer with unsupervised learning durations based on Transformer & Conformer blocks, supporting for Vietnamese language.

    Language:Python5100
  • aistairc/BERT-NAR-BERT

    BERT-based pre-trained non-autoregressive sequence-to-sequence model

    Language:Python3521
  • LARC-CMU-SMU/Enconter

    Implementation of 2021 EACL paper Enconter

    Language:Jupyter Notebook2400
  • mahshid1378/Parallel-Tacotron2

    PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

    Language:Python1