conformer

There are 64 repositories under conformer topic.

  • modelscope/FunASR

    A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

    Language:Python12.6k931.5k1.3k
  • PaddlePaddle/PaddleSpeech

    Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

    Language:Python12.2k1882k1.9k
  • wenet-e2e/wenet

    Production First and Production Ready End-to-End Speech Recognition Toolkit

    Language:Python4.8k931.1k1.2k
  • FireRedTeam/FireRedASR

    Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.

    Language:Python1.3k1656102
  • sooftware/conformer

    [Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

    Language:Python1.1k737186
  • TensorSpeech/TensorFlowASR

    :zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

    Language:Python99526212239
  • yeyupiaoling/PPASR

    基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

    Language:Python86512186130
  • yeyupiaoling/MASR

    Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

    Language:Python7041273113
  • sooftware/kospeech

    Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

    Language:Python63220135194
  • eeyhsong/EEG-Conformer

    EEG Transformer 2.0. i. Convolutional Transformer for EEG Decoding. ii. Novel visualization - Class Activation Topography.

    Language:Python61844591
  • liusongxiang/ppg-vc

    PPG-Based Voice Conversion

    Language:Python34593176
  • voicekit-team/T-one

    T-one is a high-performance streaming ASR pipeline for Russian, specialized for the telephony domain.

    Language:Python18316
  • istupakov/onnx-asr

    Automatic Speech Recognition in Python using ONNX models

    Language:Python1169
  • tuanio/noisy-student-training-asr

    Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem

    Language:Python971515
  • hyperion-ml/hyperion

    Python toolkit for speech processing

    Language:Python7114221
  • sooftware/lightning-asr

    Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

    Language:Python47226
  • MinkaiXu/CGCF-ConfGen

    :test_tube: Learning Neural Generative Dynamics for Molecular Conformation Generation (ICLR 2021)

    Language:Python463416
  • Rishit-dagli/Conformer

    An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras

    Language:Python452911
  • TeaPoly/Conformer-Athena

    Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.

    Language:Python44118
  • Audio-WestlakeU/SAR-SSL

    A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer” [TASLP 2024]

    Language:Python37231
  • VITA-Group/Audio-Lottery

    [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Zhangyang Wang

    Language:Python321135
  • 0xallam/Brain-EEG-Emotion-Classifier

    Emotion classification from Brain EEG signals using a hybrid CNN-Transformer model and various ML algorithms.

    Language:Jupyter Notebook29201
  • RDMC

    xiaoruiDong/RDMC

    Reaction Data and Molecular Conformers (RDMC) is a package dealing with reactions, molecules, conformers, majorly in 3D.

    Language:Jupyter Notebook287181
  • jreremy/conformer

    Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.

    Language:Python27124
  • DataXujing/ASR-paper

    :fire: ASR教程: https://dataxujing.github.io/ASR-paper/

  • UnixJunkie/smi2sdf3d

    3D diverse conformers generation using rdkit

    Language:Python2331512
  • msalhab96/Conformer

    An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper

    Language:Python20132
  • manhph2211/ViSTT

    I'm building an end-to-end Vietnamese Speech Recognition System. I'll deploy it into production with the help of Flask, Uwsgi, Nginx, and AWS ...

    Language:Python17222
  • ADicksonLab/AGDIFF

    Implementation of AGDIFF: Attention-Enhanced Diffusion for Molecular Geometry Prediction

    Language:Python15230
  • jaketae/conformer

    PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition

    Language:Python1520
  • tuanio/conformer-rnnt

    Conformer RNN-Transducer

    Language:Python14101
  • aidayang/FunASR-OneClick

    FunASR实时语音识别版,识别麦克风和电脑内播放的声音,电脑语音打字软件

  • lucadellalib/ts-asr

    Target speaker automatic speech recognition (TS-ASR)

    Language:Python11235
  • tuanio/nextformer

    PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"

    Language:Python11202
  • danieleninni/small-footprint-keyword-spotting

    Effective processing pipeline and advanced neural network architectures for small-footprint keyword spotting

    Language:Python10102
  • LuluW8071/Conformer

    End-to-End Speech Recognition Training with Conformer CTC using PyTorch Lightning⚡

    Language:Jupyter Notebook10202