/Conformer

Primary LanguagePythonApache License 2.0Apache-2.0

Conformer

PyTorch Implementation: Convolution-augmented Transformer for Speech Recognition

License Framework

Working now...

Introduction

This is an ASR model called Conformer made by Google.
This paper introduces only encoder models. However, I implemented both encoder and decoder model using PyTorch.
Encoder was implemented as conformer according to the paper, and decoder was implemented as 'Something'. (Decoder has not been decided which model to use)

Conformer Architecture

Conformer Encoder Architecture
description

Feed Forward Module

Feed Forward Module
description

Multi-Head Self Attention Module

Multi-Head Self Attention Module
description

Convolution Module

Convolution Module
description

Installation

Usage

Train

from conformer.trainer import Trainer

Trainer().fit(...)

Evaluation

from conformer.predictor import Predictor

Predictor(model_path='path/to/').eval(...)

Reference

Paper

Github