When using Conformer in speech recognition
wszyy opened this issue · 1 comments
wszyy commented
The output of DecoderRNN-T is combined with 4 dimensions, how to use it to recognize speech? Besides, the auther make the model architecture with the LAS? Such as: Conformer-Encoder, LSTM-Decoder, Attention?
sooftware commented
Check this project