/Awesome-Mamba

Awsome works based on SSM and Mamba

Awesome-Mamba

Awsome works based on SSM and Mamba

SSMs

Blog

Post

Title Year Venue Code PDF
The pitfalls of next-token prediction 2024 arXiv code pdf
The Hidden Attention of Mamba Models 2024 arXiv code pdf
Theoretical Foundations of Deep Selective State-Space Models 2024 arXiv code pdf
Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling 2024 arXiv code pdf
Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data 2024 arXiv code pdf
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks 2024 arXiv code pdf
Is Mamba Capable of In-Context Learning? 2024 arXiv code pdf
BlackMamba: Mixture of Experts for State-Space Models 2024 arXiv code pdf
Mamba: Linear-Time Sequence Modeling with Selective State Spaces 2023 arXiv code pdf
Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors 2023 arXiv code pdf
What Makes Convolutional Models Great on Long Sequence Modeling? 2023 ICLR 2023 code pdf
Mega: Moving Average Equipped Gated Attention 2023 ICLR 2023 code pdf
Liquid Structural State-Space Models 2023 ICLR 2023 code pdf
Hungry Hungry Hippos: Towards Language Modeling with State Space Models 2022 ICLR 2023 code pdf
On the Parameterization and Initialization of Diagonal State Space Models 2022 NeurIPS 2022 code pdf
S4ND: Modeling Images and Videos as Multidimensional Signals Using State Spaces 2022 NeurIPS 2022 code pdf
Long Range Language Modeling via Gated State Spaces 2022 ICLR 2023 code pdf
Simplified State Space Layers for Sequence Modeling 2022 ICLR 2023 code pdf
How to Train Your HiPPO: State Space Models with Generalized Orthogonal Basis Projections 2022 ICLR 2023 code pdf
Diagonal State Spaces are as Effective as Structured State Spaces 2022 NeurIPS 2022 - Spotlight code pdf
Efficiently Modeling Long Sequences with Structured State Spaces 2021 ICLR 2022 - Outstanding Paper HM code pdf
Combining Recurrent, Convolutional, and Continuous-time Models with the Linear State Space Layer 2021 NeurIPS 2021 code pdf
HiPPO: Recurrent Memory with Optimal Polynomial Projections 2020 NeurIPS 2020 - Spotlight code pdf

Some Analysis and Discussion

Title Year Venue Code PDF
Does Transformer Interpretability Transfer to RNNs? 2024 arXiv code pdf
Locating and Editing Factual Associations in Mamba 2024 arXiv code pdf

Multimodal understanding

Title Year Venue Code PDF
SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding 2024 arXiv code pdf
ReMamber: Referring Image Segmentation with Mamba Twister 2024 arXiv code pdf
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference 2024 arXiv code pdf
VL-Mamba: Exploring State Space Models for Multimodal Learning 2024 arXiv code pdf

Neural rendering

Title Year Venue Code PDF
3DMambaIPF: A State Space Model for Iterative Point Cloud Filtering via Differentiable Rendering 2024 arXiv code pdf
Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction 2024 arXiv code pdf

Time Series

Title Year Venue Code PDF
RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos 2024 arXiv code pdf
HARMamba: Efficient Wearable Sensor Human Activity Recognition Based on Bidirectional Selective SSM 2024 arXiv code pdf
Uncovering Selective State Space Model’s Capabilities in Lifelong Sequential Recommendation 2024 arXiv code pdf
TimeMachine: A Time Series is Worth 4 Mambas for Long-term Forecasting 2024 arXiv code pdf
Is Mamba Effective for Time Series Forecasting? 2024 arXiv code pdf

Audio generation

Title Year Venue Code PDF
SPMamba: State-space model is all you need in speech separation 2024 arXiv code pdf
Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation 2024 arXiv code pdf
Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers 2024 arXiv code pdf
MambaLithium: Selective state space model for remaining-useful-life, state-of-health, and state-of-charge estimation of lithium-ion batteries 2024 arXiv code pdf
MambaStock: Selective state space model for stock prediction 2024 arXiv code pdf
It's Raw! Audio Generation with State-Space Models 2022 ICML 2022 - Long Talk code pdf

NLP

Title Year Venue Code PDF
ClinicalMamba: A Generative Clinical Language Model on Longitudinal Clinical Notes 2024 arXiv code pdf
DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models 2024 arXiv code pdf
MambaByte: Token-free Selective State Space Model 2024 arXiv code pdf
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts 2024 arXiv code pdf

Graph Neural Network

Title Year Venue Code PDF
STG-Mamba: Spatial-Temporal Graph Learning via Selective State Space Model 2024 arXiv code pdf
Graph Mamba: Towards Learning on Graphs with State Space Models 2024 arXiv code pdf

Computer Vision

Title Year Venue Code PDF
Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation 2024 arXiv code pdf
ChangeMamba: Remote Sensing Change Detection with Spatio-Temporal State Space Model 2024 arXiv code pdf
RS-Mamba for Large Remote Sensing Image Dense Prediction 2024 arXiv code pdf
RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation 2024 arXiv code pdf
Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model 2024 arXiv code pdf
RSMamba: Remote Sensing Image Classification with State Space Model 2024 arXiv code pdf
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition 2024 arXiv code pdf
VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting 2024 arXiv code pdf
LocalMamba: Visual State Space Model with Windowed Selective Scan 2024 arXiv code pdf
VideoMamba: State Space Model for Efficient Video Understanding 2024 arXiv code pdf
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding 2024 arXiv code pdf
EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba 2024 arXiv code pdf
On the low-shot transferability of [V]-Mamba 2024 arXiv code pdf
MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models 2024 arXiv code pdf
MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection 2024 arXiv code pdf
MambaIR: A Simple Baseline for Image Restoration with State-Space Model 2024 arXiv code pdf
Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning 2024 arXiv code pdf
Pan-Mamba: Effective pan-sharpening with State Space Model 2024 arXiv code pdf
U-shaped Vision Mamba for Single Image Dehazing 2024 arXiv code pdf
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model 2024 arXiv code pdf

Diffusion Model

Title Year Venue Code PDF
ZigMa: A DiT-style Zigzag Mamba Diffusion Model 2024 arXiv code pdf

Super-Resolution

Title Year Venue Code PDF
Activating Wider Areas in Image Super-Resolution 2024 arXiv code pdf

Digital human

Title Year Venue Code PDF
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models 2024 arXiv code pdf

Human Pose Estimation

Title Year Venue Code PDF
Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM 2024 arXiv code pdf

Foundation Model / Model Framework

Title Year Venue Code PDF
Jamba: A Hybrid Transformer-Mamba Language Model 2024 arXiv code pdf
State Space Models as Foundation Models: A Control Theoretic Overview 2024 arXiv code pdf
SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series 2024 arXiv code pdf
Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining 2024 arXiv code pdf

Medical Image Analysis

Title Year Venue Code PDF
MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose Prediction 2024 arXiv code pdf
Large Window-based Mamba UNet for Medical Image Segmentation: Beyond Convolution and Self-attention 2024 arXiv code pdf
A multi-cohort study on prediction of acute brain dysfunction states using selective state space models 2024 arXiv code pdf
MD-Dose: A diffusion model based on the Mamba for radiation dose prediction 2024 arXiv code pdf

Muliti-Modal Medical Image Analysis

Title Year Venue Code PDF

Medical Image Segmentation

Title Year Venue Code PDF
T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation 2024 arXiv code pdf
UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion Segmentation 2024 arXiv code pdf
Rotate to Scan: UNet-like Mamba with Triplet SSM Module for Medical Image Segmentation 2024 arXiv code pdf
Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion 2024 arXiv code pdf
ProMamba: Prompt-Mamba for polyp segmentation 2024 arXiv code pdf
H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation 2024 arXiv code pdf
VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation 2024 arXiv code pdf
Large Window-based Mamba UNet for Medical Image Segmentation: Beyond Convolution and Self-attention 2024 arXiv code pdf
VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation 2024 arXiv code pdf
LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation 2024 arXiv code pdf
Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation 2024 arXiv code pdf
P-Mamba: Marrying Perona Malik Diffusion with Mamba for Efficient Pediatric Echocardiographic Left Ventricular Segmentation 2024 arXiv code pdf
Semi-Mamba-UNet: Pixel-Level Contrastive Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image Segmentation 2024 arXiv code pdf
Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation 2024 arXiv code pdf
VM-UNet: Vision Mamba UNet for Medical Image Segmentation 2024 arXiv code pdf
Vivim: a Video Vision Mamba for Medical Video Object Segmentation 2024 arXiv code pdf
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation 2024 arXiv code pdf
U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation 2024 arXiv code pdf

Medical Image Calssification

Title Year Venue Code PDF
CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal Representation Learning for AD classification 2024 arXiv code pdf
MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational Pathology 2024 arXiv code pdf
MedMamba: Vision Mamba for Medical Image Classification 2024 arXiv code pdf

Medical Image Registration

Title Year Venue Code PDF
VMambaMorph: a Visual Mamba-based Framework with Cross-Scan Module for Deformable 3D Image Registration 2024 arXiv code pdf
MambaMorph: a Mamba-based Backbone with Contrastive Feature Learning for Deformable MR-CT Registration 2024 arXiv code pdf

Others

Title Year Venue Code PDF
Motion-Guided Dual-Camera Tracker for Low-Cost Skill Evaluation of Gastric Endoscopy 2024 arXiv code pdf
FD-Vision Mamba for Endoscopic Exposure Correction 2024 arXiv code pdf
Mamba4Rec: Towards Efficient Sequential Recommendation with Selective State Space Models 2024 arXiv code pdf
Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling 2024 arXiv code pdf

Reinforcement Learning

Tital Year Venue Code PDF
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces 2024 arXiv code pdf
MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning 2024 ICLR 2024 code pdf

Point Cloud

Title Year Venue Code PDF
Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy 2024 arXiv code pdf
Point Could Mamba: Point Cloud Learning via State Space Model 2024 arXiv code pdf
PointMamba: A Simple State Space Model for Point Cloud Analysis 2024 arXiv code pdf

Robotics

Title Year Venue Code PDF
Music to Dance as Language Translation using Sequence Models 2024 arXiv code pdf

Structulal Data

Title Year Venue Code PDF
A multi-cohort study on prediction of acute brain dysfunction states using selective state space models 2024 arXiv code pdf
MambaTab: A Simple Yet Effective Approach for Handling Tabular Data 2024 arXiv code pdf

Competitor

Title Year Venue Code PDF
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models 2024 arXiv code pdf
Gated Linear Attention Transformers with Hardware-Efficient Training 2023 arXiv code pdf

Pitfalls

Title Year Venue Code PDF
The pitfalls of next-token prediction 2024 arXiv code pdf