Transformer are RNNs: Fast Autoregressive Transformer with Linear Attention
kduxin/Linear-Transformer
Transformer are RNNs: Fast Autoregressive Transformer with Linear Attention
Python
Transformer are RNNs: Fast Autoregressive Transformer with Linear Attention
Python