conformer

There are 53 repositories under conformer topic.

  • PaddlePaddle/PaddleSpeech

    Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

    Language:Python11.4k1851.9k1.9k
  • modelscope/FunASR

    A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

    Language:Python7.7k701.3k806
  • wenet-e2e/wenet

    Production First and Production Ready End-to-End Speech Recognition Toolkit

    Language:Python4.3k901.1k1.1k
  • sooftware/conformer

    [Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

    Language:Python985937179
  • TensorSpeech/TensorFlowASR

    :zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

    Language:Python95333208245
  • yeyupiaoling/PPASR

    基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

    Language:Python83111183130
  • yeyupiaoling/MASR

    Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

    Language:Python6301271109
  • sooftware/kospeech

    Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

    Language:Python60821135192
  • eeyhsong/EEG-Conformer

    EEG Transformer 2.0. i. Convolutional Transformer for EEG Decoding. ii. Novel visualization - Class Activation Topography.

    Language:Python48554466
  • liusongxiang/ppg-vc

    PPG-Based Voice Conversion

    Language:Python330103172
  • tuanio/noisy-student-training-asr

    Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem

    Language:Python882515
  • hyperion-ml/hyperion

    Python toolkit for speech processing

    Language:Python6814221
  • MinkaiXu/CGCF-ConfGen

    :test_tube: Learning Neural Generative Dynamics for Molecular Conformation Generation (ICLR 2021)

    Language:Python454416
  • sooftware/lightning-asr

    Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

    Language:Python45326
  • TeaPoly/Conformer-Athena

    Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.

    Language:Python43118
  • Rishit-dagli/Conformer

    An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras

    Language:Python42397
  • Audio-WestlakeU/SAR-SSL

    A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer” [TASLP 2024]

    Language:Python32331
  • VITA-Group/Audio-Lottery

    [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Zhangyang Wang

    Language:Python301135
  • jreremy/conformer

    Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.

    Language:Python25124
  • RDMC

    xiaoruiDong/RDMC

    Reaction Data and Molecular Conformers (RDMC) is a package dealing with reactions, molecules, conformers, majorly in 3D.

    Language:Jupyter Notebook257181
  • DataXujing/ASR-paper

    :fire: ASR教程: https://dataxujing.github.io/ASR-paper/

  • UnixJunkie/smi2sdf3d

    3D diverse conformers generation using rdkit

    Language:Python2331512
  • ahmed-alllam/Brain-EEG-Emotion-Classifier

    Emotion classification from Brain EEG signals using a hybrid CNN-Transformer model and various ML algorithms.

    Language:Jupyter Notebook17200
  • manhph2211/ViSTT

    I'm building an end-to-end Vietnamese Speech Recognition System. I'll deploy it into production with the help of Flask, Uwsgi, Nginx, and AWS ...

    Language:Python17222
  • msalhab96/Conformer

    An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper

    Language:Python17132
  • jaketae/conformer

    PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition

    Language:Python1420
  • tuanio/conformer-rnnt

    Conformer RNN-Transducer

    Language:Python14101
  • tuanio/nextformer

    PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"

    Language:Python12202
  • lucadellalib/ts-asr

    Target speaker automatic speech recognition (TS-ASR)

    Language:Python11235
  • danieleninni/small-footprint-keyword-spotting

    Effective processing pipeline and advanced neural network architectures for small-footprint keyword spotting

    Language:Python9102
  • hoangtuanvu/conformer_ocr

    Transformer OCR is a Optical Character Recognition tookit built for researchers working on both OCR for both Vietnamese and English. This project only focused on variants of vanilla Transformer (Conformer) and Feature Extraction (CNN-based approach).

    Language:Python9223
  • ADicksonLab/AGDIFF

    Implementation of AGDIFF: Attention-Enhanced Diffusion for Molecular Geometry Prediction

    Language:Python7310
  • tuanio/asr-toolkit

    E2E Speech Recognition Toolkit with Hydra and Pytorch Lightning

    Language:Python7101
  • LENSS/EMSAssist

    This is the official artifact for EMSAssist paper on MobiSys'23. EMSAssist: An End-to-End Mobile Voice Assistant at the Edge for Emergency Medical Services

    Language:Python6210
  • PeaWagon/Kaplan

    Conformer searching package.

    Language:TeX62494
  • Molecular3DLengthDescriptors

    ThomasJewson/Molecular3DLengthDescriptors

    A 3D conformational based molecular descriptor set for use in QSPR and Machine Learning.

    Language:Python6100