Awsome works based on SSM and Mamba
- A Visual Guide to Mamba and State Space Models (MAARTEN GROOTENDORST)
- Introduction to State Space Models (SSM) (lbourdois Loïck BOURDOIS)
- H3: Language Modeling with State Space Models and (Almost) No Attention
- The Annotated S4
- Structured State Spaces for Sequence Modeling (S4) part 1
- Structured State Spaces for Sequence Modeling (S4) part 2
- Structured State Spaces for Sequence Modeling (S4) part 3
- HiPPO: Recurrent Memory with Optimal Polynomial Projections
Title | Year | Venue | Code | |
---|---|---|---|---|
Does Transformer Interpretability Transfer to RNNs? | 2024 | arXiv | code | |
Locating and Editing Factual Associations in Mamba | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding | 2024 | arXiv | code | |
ReMamber: Referring Image Segmentation with Mamba Twister | 2024 | arXiv | code | |
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference | 2024 | arXiv | code | |
VL-Mamba: Exploring State Space Models for Multimodal Learning | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
3DMambaIPF: A State Space Model for Iterative Point Cloud Filtering via Differentiable Rendering | 2024 | arXiv | code | |
Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos | 2024 | arXiv | code | |
HARMamba: Efficient Wearable Sensor Human Activity Recognition Based on Bidirectional Selective SSM | 2024 | arXiv | code | |
Uncovering Selective State Space Model’s Capabilities in Lifelong Sequential Recommendation | 2024 | arXiv | code | |
TimeMachine: A Time Series is Worth 4 Mambas for Long-term Forecasting | 2024 | arXiv | code | |
Is Mamba Effective for Time Series Forecasting? | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
SPMamba: State-space model is all you need in speech separation | 2024 | arXiv | code | |
Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation | 2024 | arXiv | code | |
Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers | 2024 | arXiv | code | |
MambaLithium: Selective state space model for remaining-useful-life, state-of-health, and state-of-charge estimation of lithium-ion batteries | 2024 | arXiv | code | |
MambaStock: Selective state space model for stock prediction | 2024 | arXiv | code | |
It's Raw! Audio Generation with State-Space Models | 2022 | ICML 2022 - Long Talk | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
ClinicalMamba: A Generative Clinical Language Model on Longitudinal Clinical Notes | 2024 | arXiv | code | |
DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models | 2024 | arXiv | code | |
MambaByte: Token-free Selective State Space Model | 2024 | arXiv | code | |
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
STG-Mamba: Spatial-Temporal Graph Learning via Selective State Space Model | 2024 | arXiv | code | |
Graph Mamba: Towards Learning on Graphs with State Space Models | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
ZigMa: A DiT-style Zigzag Mamba Diffusion Model | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
Activating Wider Areas in Image Super-Resolution | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
Jamba: A Hybrid Transformer-Mamba Language Model | 2024 | arXiv | code | |
State Space Models as Foundation Models: A Control Theoretic Overview | 2024 | arXiv | code | |
SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series | 2024 | arXiv | code | |
Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose Prediction | 2024 | arXiv | code | |
Large Window-based Mamba UNet for Medical Image Segmentation: Beyond Convolution and Self-attention | 2024 | arXiv | code | |
A multi-cohort study on prediction of acute brain dysfunction states using selective state space models | 2024 | arXiv | code | |
MD-Dose: A diffusion model based on the Mamba for radiation dose prediction | 2024 | arXiv | code |
Title | Year | Venue | Code |
---|
Title | Year | Venue | Code | |
---|---|---|---|---|
CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal Representation Learning for AD classification | 2024 | arXiv | code | |
MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational Pathology | 2024 | arXiv | code | |
MedMamba: Vision Mamba for Medical Image Classification | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
VMambaMorph: a Visual Mamba-based Framework with Cross-Scan Module for Deformable 3D Image Registration | 2024 | arXiv | code | |
MambaMorph: a Mamba-based Backbone with Contrastive Feature Learning for Deformable MR-CT Registration | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
Motion-Guided Dual-Camera Tracker for Low-Cost Skill Evaluation of Gastric Endoscopy | 2024 | arXiv | code | |
FD-Vision Mamba for Endoscopic Exposure Correction | 2024 | arXiv | code | |
Mamba4Rec: Towards Efficient Sequential Recommendation with Selective State Space Models | 2024 | arXiv | code | |
Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling | 2024 | arXiv | code |
Tital | Year | Venue | Code | |
---|---|---|---|---|
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces | 2024 | arXiv | code | |
MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning | 2024 | ICLR 2024 | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy | 2024 | arXiv | code | |
Point Could Mamba: Point Cloud Learning via State Space Model | 2024 | arXiv | code | |
PointMamba: A Simple State Space Model for Point Cloud Analysis | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
Music to Dance as Language Translation using Sequence Models | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
A multi-cohort study on prediction of acute brain dysfunction states using selective state space models | 2024 | arXiv | code | |
MambaTab: A Simple Yet Effective Approach for Handling Tabular Data | 2024 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models | 2024 | arXiv | code | |
Gated Linear Attention Transformers with Hardware-Efficient Training | 2023 | arXiv | code |
Title | Year | Venue | Code | |
---|---|---|---|---|
The pitfalls of next-token prediction | 2024 | arXiv | code |