CSLJingyu

CSLJingyu's Stars

median-research-group/LibMTL
A PyTorch Library for Multi-Task Learning
Language:Python2.1k201
xuchennlp/S2T
The project for speech translation
Language:Python112
burchim/EfficientConformer
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
Language:Python21431
tonyduan/transformer-blocks
Multi-Head Attention, Transformer, Perceiver, Linear Attention.
Language:Python102
theodorblackbird/lina-speech
Official implementation of the TTS model Lina-Speech
Language:Jupyter Notebook14512
lironui/Linear-Attention-Mechanism
Attention mechanism
Language:Python5412
kuixu/Linear-Multihead-Attention
Reproducing the Linear Multihead Attention introduced in Linformer paper (Linformer: Self-Attention with Linear Complexity)
Language:Python7214
ShigekiKarita/espnet-semi-supervised
ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tree/karita-asrtts for newer code in ICASSP2019 Semi-supervised End-to-end Speech Recognition Using Text-to-speech and Autoencoders
Language:Python3812
k2-fsa/k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
Language:Cuda1.1k217
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Language:Python11.7k1.9k
DCGM/SoftCTC
This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135
Language:Python191
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python12.7k2.6k
nvidia-riva/riva-asrlib-decoder
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
Language:Python8323
scufan1990/Key-Frame-Mechanism-For-Efficient-Conformer
code
Language:Python31
phecda-xu/wav2vec-project
Study of fairseq's wav2vec
Language:Python71
lucidrains/FLASH-pytorch
Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"
Language:Python35424
radarFudan/Awesome-state-space-models
Collection of papers on state-space models
56720
state-spaces/s4
Structured state space sequence models
Language:Jupyter Notebook2.5k302
state-spaces/mamba
Mamba SSM architecture
Language:Python13.7k1.2k
CSLJingyu/pytorch_MLP_for_ASR
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
1
idiap/hypermixing
PyTorch implementation for HyperMixing, a linear-time token-mixing technique used in HyperMixer architecture
Language:Python212
tuanio/nextformer
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
Language:Python122
C-Fun/Self-Attentive-Pooling-for-Efficient-Deep-Learning
Official PyTorch implementation of the paper entitled 'Self Attentive Pooling for Efficient Deep Learning'.
Language:Python10
yunjey/pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers
Language:Python30.5k8.2k
ASR-studio/LAS-ASR
LAS自定义数据集训练语音识别模型
Language:Python2
wttu/dlbeginners
神经网络与深度学习
Language:Jupyter Notebook225
seungjunlee96/Depthwise-Separable-Convolution_Pytorch
Implementation of Depthwise Separable Convolution (pytorch)
Language:Python717
langgptai/wonderful-prompts
🔥中文 prompt 精选🔥，ChatGPT 使用指南，提升 ChatGPT 可玩性和可用性！🚀
3.6k311
jwr1995/DTCN
Language:Python142
jwr1995/dc1d
A 1D implementation of a deformable convolutional layer in PyTorch with a few tricks.
Language:Python404