CSLJingyu's Stars
median-research-group/LibMTL
A PyTorch Library for Multi-Task Learning
xuchennlp/S2T
The project for speech translation
burchim/EfficientConformer
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
tonyduan/transformer-blocks
Multi-Head Attention, Transformer, Perceiver, Linear Attention.
theodorblackbird/lina-speech
Official implementation of the TTS model Lina-Speech
lironui/Linear-Attention-Mechanism
Attention mechanism
kuixu/Linear-Multihead-Attention
Reproducing the Linear Multihead Attention introduced in Linformer paper (Linformer: Self-Attention with Linear Complexity)
ShigekiKarita/espnet-semi-supervised
ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tree/karita-asrtts for newer code in ICASSP2019 Semi-supervised End-to-end Speech Recognition Using Text-to-speech and Autoencoders
k2-fsa/k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
DCGM/SoftCTC
This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
nvidia-riva/riva-asrlib-decoder
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
scufan1990/Key-Frame-Mechanism-For-Efficient-Conformer
code
phecda-xu/wav2vec-project
Study of fairseq's wav2vec
lucidrains/FLASH-pytorch
Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"
radarFudan/Awesome-state-space-models
Collection of papers on state-space models
state-spaces/s4
Structured state space sequence models
state-spaces/mamba
Mamba SSM architecture
CSLJingyu/pytorch_MLP_for_ASR
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
idiap/hypermixing
PyTorch implementation for HyperMixing, a linear-time token-mixing technique used in HyperMixer architecture
tuanio/nextformer
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
C-Fun/Self-Attentive-Pooling-for-Efficient-Deep-Learning
Official PyTorch implementation of the paper entitled 'Self Attentive Pooling for Efficient Deep Learning'.
yunjey/pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers
ASR-studio/LAS-ASR
LAS自定义数据集训练语音识别模型
wttu/dlbeginners
神经网络与深度学习
seungjunlee96/Depthwise-Separable-Convolution_Pytorch
Implementation of Depthwise Separable Convolution (pytorch)
langgptai/wonderful-prompts
🔥中文 prompt 精选🔥,ChatGPT 使用指南,提升 ChatGPT 可玩性和可用性!🚀
jwr1995/DTCN
jwr1995/dc1d
A 1D implementation of a deformable convolutional layer in PyTorch with a few tricks.