NX-AI/vision-lstm

Comparision with VisionLSTM-Out

bio-mlhui opened this issue · 1 comments

Mamba and xLSTM are both RNN models, what is the Imagenet1k-acc if the Lstm transform are removed?

The surrounding components of the Mamba SSM and the mLSTM cells are really similar (up projection -> SSM/mLSTM -> gated MLP -> downprojection -> skip connection) so VisionLSTM-Out would be more or less equivalent to Mamba-Out.