Vision Mamba: A Comprehensive Survey and Taxonomy

Abstract: State Space Model (SSM) is a mathematical model used to describe and analyze the behavior of dynamic systems. This model has witnessed numerous applications in several fields, including control theory, signal processing, economics and machine learning. In the field of deep learning, state space models are used to process sequence data, such as time series analysis, natural language processing (NLP) and video understanding. By mapping sequence data to state space, long-term dependencies in the data can be better captured. In particular, modern SSMs have shown strong representational capabilities in NLP, especially in long sequence modeling, while maintaining linear time complexity. Notably, based on the latest state-space models, Mamba \cite{Mamba} merges time-varying parameters into SSMs and formulates a hardware-aware algorithm for efficient training and inference. Given its impressive efficiency and strong long-range dependency modeling capability, Mamba is expected to become a new AI architecture that may outperform Transformer. Recently, a number of works have attempted to study the potential of Mamba in various fields, such as general vision, multi-modal, medical image analysis and remote sensing image analysis, by extending Mamba from natural language domain to visual domain. To fully understand Mamba in the visual domain, we conduct a comprehensive survey and present a taxonomy study. This survey focuses on Mamba's application to a variety of visual tasks and data types, and discusses its predecessors, recent advances and far-reaching impact on a wide range of domains. Since Mamba is now on an upward trend, please actively notice us if you have new findings, and new progress on Mamba will be included in this survey in a timely manner and updated on the website: (https://github.com/lx6c78/Vision-Mamba-A-Comprehensive-Survey-and-Taxonomy).

We will timely update the latest representaive literatures and their released source code on this page. If you have any questions, please don't hesitate to contact us at any of the following emails: liuxiao@stu.cqu.edu.cn, zhangchenxu@cqu.edu.cn, leizhang@cqu.edu.cn

📢 Update Log

  • 2024.05.07: Our paper is released! [arXiv]
  • 2024.05.18: Added "Latest Visual Mamba Papers" column. We plan to update these papers in subsequent versions of our survey.

Citation

If you find this repository is useful for you, please cite our paper:

@misc{liu2024vision,
      title={Vision Mamba: A Comprehensive Survey and Taxonomy}, 
      author={Xiao Liu and Chenxu Zhang and Lei Zhang},
      year={2024},
      eprint={2405.04404},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Contents

Related Survey

  • State Space Model for New-Generation Network Alternative to Transformers: A Survey. [15 April 2024] [ArXiv, 2024]
    Xiao Wang, Shiao Wang, Yuhe Ding, Yuehang Li, Wentao Wu, Yao Rong, Weizhe Kong, Ju Huang, Shihao Li, Haoxiang Yang, Ziwen Wang, Bowei Jiang, Chenglong Li, Yaowei Wang, Yonghong Tian, Jin Tang.
    [Paper] [Github]
  • A Survey on Visual Mamba. [26 April, 2024] [ArXiv, 2024]
    Hanwei Zhang, Ying Zhu, Dan Wang, Lijun Zhang, Tianxiang Chen, Zi Ye.
    [Paper]
  • Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges. [24 April, 2024] [ArXiv, 2024]
    Badri Narayana Patro, Vijay Srinivas Agneeswaran.
    [Paper] [Gihub]
  • A Survey on Vision Mamba: Models, Applications and Challenges. [29 April, 2024] [ArXiv, 2024]
    Rui Xu, Shu Yang, Yihui Wang, Bo Du, Hao Chen.
    [Paper] [Gihub]
  • Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis. [5 June, 2024] [ArXiv, 2024]
    Moein Heidari, Sina Ghorbani Kolahi, Sanaz Karimijafarbigloo, Bobby Azad, Afshin Bozorgpour, Soheila Hatami, Reza Azad, Ali Diba, Ulas Bagci, Dorit Merhof, Ilker Hacihaliloglu.
    [Paper] [Gihub]

Latest vision Mamba paper

We plan to update these papers in subsequent versions of our survey.

  • CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation. [30 April, 2024] [ArXiv, 2024]
    Weiquan Huang, Yifei Shen, Yifan Yang.
    [Paper] [Code]
  • SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients. [5 May, 2024] [ArXiv, 2024]
    Tushar Verma, Jyotsna Singh, Yash Bhartari, Rishi Jarwal, Suraj Singh, Shubhkarman Singh.
    [Paper] [Code]
  • SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising. [15 May, 2024] [ArXiv, 2024]
    Guanyiman Fu, Fengchao Xiong, Jianfeng Lu, Jun Zhou, Yuntao Qian.
    [Paper] [Code]
  • FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State Space. [9 May, 2024] [ArXiv, 2024]
    Hui Ma, Sen Lei, Turgay Celik, Heng-Chao Li.
    [Paper] [Code]
  • DVMSR: Distillated Vision Mamba for Efficient Super-Resolution. [11 May, 2024] [ArXiv, 2024]
    Xiaoyan Lei, Wenlong Zhang, Weifeng Cao.
    [Paper] [Code]
  • AC-MAMBASEG: An adaptive convolution and Mamba-based architecture for enhanced skin lesion segmentation. [5 May, 2024] [ArXiv, 2024]
    Xiaoyan Lei, Wenlong Zhang, Weifeng Cao.
    [Paper] [Code]
  • Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement. [6 May, 2024] [ArXiv, 2024]
    Jiesong Bai, Yuhao Yin, Qiyuan He.
    [Paper] [Code]
  • VMambaCC: A Visual State Space Model for Crowd Counting. [6 May, 2024] [ArXiv, 2024]
    Hao-Yuan Ma, Li Zhang, Shuai Shi.
    [Paper]
  • Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models. [8 May, 2024] [ArXiv, 2024]
    Zhengxing Lan, Hongbo Li, Lingshan Liu, Bo Fan, Yisheng Lv, Yilong Ren, Zhiyong Cui.
    [Paper]
  • Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution. [8 May, 2024] [ArXiv, 2024]
    Yi Xiao, Qiangqiang Yuan, Kui Jiang, Yuzeng Chen, Qiang Zhang, Chia-Wen Lin.
    [Paper]
  • HC-Mamba: Vision MAMBA with Hybrid Convolutional Techniques for Medical Image Segmentation. [11 May, 2024] [ArXiv, 2024]
    Jiashu Xu.
    [Paper]
  • VM-DDPM: Vision Mamba Diffusion for Medical Image Synthesis. [9 May, 2024] [ArXiv, 2024]
    Zhihan Ju, Wanting Zhou.
    [Paper]
  • Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba. [9 May, 2024] [ArXiv, 2024]
    Hongwei Ren, Yue Zhou, Jiadong Zhu, Haotian Fu, Yulong Huang, Xiaopeng Lin, Yuetong Fang, Fei Ma, Hao Yu, Bojun Cheng.
    [Paper]
  • GMSR:Gradient-Guided Mamba for Spectral Reconstruction from RGB Images. [13 May, 2024] [ArXiv, 2024]
    Xinying Wang, Zhixiong Huang, Sifan Zhang, Jiawen Zhu, Lin Feng.
    [Paper] [Code]
  • OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition. [13 May, 2024] [ArXiv, 2024]
    Qiuchi Xiang, Jintao Cheng, Jiehao Luo, Jin Wu, Rui Fan, Xieyuanli Chen, Xiaoyu Tang.
    [Paper]
  • MambaOut: Do We Really Need Mamba for Vision? [14 May, 2024] [ArXiv, 2024]
    Weihao Yu, Xinchao Wang.
    [Paper] [Code]
  • Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study. [14 May, 2024] [ArXiv, 2024]
    Qinfeng Zhu, Yuan Fang, Yuanzhi Cai, Cheng Chen, Lei Fan.
    [Paper]
  • IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model. [16 May, 2024] [ArXiv, 2024]
    Yongsong Huang, Tomo Miyazaki, Xiaofeng Liu, Shinichiro Omachi.
    [Paper] [Code]
  • RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing. [16 May, 2024] [ArXiv, 2024]
    Huiling Zhou, Xianhao Wu, Hongming Chen, Xiang Chen, Xin He.
    [Paper]
  • CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation. [17 May, 2024] [ArXiv, 2024]
    Mushui Liu, Jun Dan, Ziqian Lu, Yunlong Yu, Yingming Li, Xi Li.
    [Paper] [Code]
  • Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification. [25 May, 2024] [ArXiv, 2024]
    Weilian Zhou, Sei-Ichiro Kamata, Haipeng Wang, Man-Sing Wong, Huiying, Hou.
    [Paper] [Code]
  • 3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification. [21 May, 2024] [ArXiv, 2024]
    Yan He, Bing Tu, Bo Liu, Jun Li, Antonio Plaza.
    [Paper]
  • I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling. [22 May, 2024] [ArXiv, 2024]
    Omer F. Atli, Bilal Kabas, Fuat Arslan, Mahmut Yurt, Onat Dalmaz, Tolga Çukur.
    [Paper] [Code]
  • Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model. [23 May, 2024] [ArXiv, 2024]
    Yuheng Shi, Minjing Dong, Chang Xu.
    [Paper] [Code]
  • DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis. [23 May, 2024] [ArXiv, 2024]
    Yao Teng, Yue Wu, Han Shi, Xuefei Ning, Guohao Dai, Yu Wang, Zhenguo Li, Xihui Liu.
    [Paper] [Code]
  • MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models. [23 May, 2024] [ArXiv, 2024]
    Jiuming Liu, Jinru Han, Lihao Liu, Angelica I. Aviles-Rivero, Chaokang Jiang, Zhe Liu, Hesheng Wang.
    [Paper]
  • Scalable Visual State Space Model with Fractal Scanning. [26 May, 2024] [ArXiv, 2024]
    Lv Tang, HaoKe Xiao, Peng-Tao Jiang, Hao Zhang, Jinwei Chen, Bo Li.
    [Paper]
  • Mamba-R: Vision Mamba ALSO Needs Registers. [23 May, 2024] [ArXiv, 2024]
    Feng Wang, Jiahao Wang, Sucheng Ren, Guoyizhe Wei, Jieru Mei, Wei Shao, Yuyin Zhou, Alan Yuille, Cihang Xie.
    [Paper] [Homepage] [Code]
  • PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud Learning. [24 May, 2024] [ArXiv, 2024]
    Qingdong He, Jiangning Zhang, Jinlong Peng, Haoyang He, Yabiao Wang, Chengjie Wang.
    [Paper] [Code]
  • PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis. [24 May, 2024] [ArXiv, 2024]
    Zicheng Wang, Zhenghao Chen, Yiming Wu, Zhen Zhao, Luping Zhou, Dong Xu.
    [Paper] [Code]
  • Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models. [27 May, 2024] [ArXiv, 2024]
    Byung-Kwan Lee, Chae Won Kim, Beomchan Park, Yong Man Ro.
    [Paper] [Code]
  • Scaling Diffusion Mamba with Bidirectional SSMs for Efficient Image and Video Generation. [24 May, 2024] [ArXiv, 2024]
    Shentong Mo, Yapeng Tian.
    [Paper]
  • MUCM-Net: A Mamba Powered UCM-Net for Skin Lesion Segmentation. [24 May, 2024] [ArXiv, 2024]
    Chunyu Yuan, Dongfang Zhao, Sos S. Agaian.
    [Paper] [Code]
  • Demystify Mamba in Vision: A Linear Attention Perspective. [26 May, 2024] [ArXiv, 2024]
    Dongchen Han, Ziyi Wang, Zhuofan Xia, Yizeng Han, Yifan Pu, Chunjiang Ge, Jun Song, Shiji Song, Bo Zheng, Gao Huang.
    [Paper] [Code]
  • TokenUnify: Scalable Autoregressive Visual Pre-training with Mixture Token Prediction. [27 May, 2024] [ArXiv, 2024]
    Yinda Chen, Haoyuan Shi, Xiaoyu Liu, Te Shi, Ruobing Zhang, Dong Liu, Zhiwei Xiong, Feng Wu.
    [Paper] [Code]
  • LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling. [27 May, 2024] [ArXiv, 2024]
    Yaohua Zha, Naiqi Li, Yanzi Wang, Tao Dai, Hang Guo, Bin Chen, Zhi Wang, Zhihao Ouyang, Shu-Tao Xia.
    [Paper]
  • Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba. [27 May, 2024] [ArXiv, 2024]
    Jiahao Huang, Liutao Yang, Fanwen Wang, Yinzhe Wu, Yang Nan, Weiwen Wu, Chengyan Wang, Kuangyu Shi, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Daoqiang Zhang, Guang Yang.
    [Paper]
  • Deciphering Movement: Unified Trajectory Generation Model for Multi-Agent. [27 May, 2024] [ArXiv, 2024]
    Yi Xu, Yun Fu.
    [Paper] [Code]
  • DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention. [28 May, 2024] [ArXiv, 2024]
    Lianghui Zhu, Zilong Huang, Bencheng Liao, Jun Hao Liew, Hanshu Yan, Jiashi Feng, Xinggang Wang.
    [Paper] [Code]
  • Cardiovascular Disease Detection from Multi-View Chest X-rays with BI-Mamba. [28 May, 2024] [ArXiv, 2024]
    Zefan Yang, Jiajin Zhang, Ge Wang, Mannudeep K. Kalra, Pingkun Yan.
    [Paper]
  • Vim-F: Visual State Space Model Benefiting from Learning in the Frequency Domain. [28 May, 2024] [ArXiv, 2024]
    Juntao Zhang, Kun Bian, Peng Cheng, Wenbo An, Jianning Liu, Jun Zhou.
    [Paper] [Code]
  • FourierMamba: Fourier Learning Integration with State Space Models for Image Deraining. [29 May, 2024] [ArXiv, 2024]
    Dong Li, Yidi Liu, Xueyang Fu, Senyan Xu, Zheng-Jun Zha.
    [Paper]
  • DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark. [30 May, 2024] [ArXiv, 2024]
    Haoxing Chen, Yan Hong, Zizheng Huang, Zhuoer Xu, Zhangxuan Gu, Yaohui Li, Jun Lan, Huijia Zhu, Jianfu Zhang, Weiqiang Wang, Huaxiong Li.
    [Paper] [Code]
  • Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging. [1 June, 2024] [ArXiv, 2024]
    Jiahua Dong, Hui Yin, Hongliu Li, Wenbo Li, Yulun Zhang, Salman Khan, Fahad Shahbaz Khan.
    [Paper] [Code]
  • MGI: Multimodal Contrastive pre-training of Genomic and Medical Imaging. [2 June, 2024] [ArXiv, 2024]
    Jiaying Zhou, Mingzhou Jiang, Junde Wu, Jiayuan Zhu, Ziyue Wang, Yueming Jin.
    [Paper]
  • LLEMamba: Low-Light Enhancement via Relighting-Guided Mamba with Deep Unfolding Network. [3 June, 2024] [ArXiv, 2024]
    Xuanqi Zhang, Haijin Zeng, Jinwang Pan, Qiangqiang Shen, Yongyong Chen.
    [Paper]
  • Dimba: Transformer-Mamba Diffusion Models. [3 June, 2024] [ArXiv, 2024]
    Zhengcong Fei, Mingyuan Fan, Changqian Yu, Debang Li, Youqiang Zhang, Junshi Huang.
    [Paper] [Homepage] [Code]
  • CDMamba: Remote Sensing Image Change Detection with Mamba. [6 June, 2024] [ArXiv, 2024]
    Haotian Zhang, Keyan Chen, Chenyang Liu, Hao Chen, Zhengxia Zou, Zhenwei Shi.
    [Paper] [Code]
  • RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation. [6 June, 2024] [ArXiv, 2024]
    Jiaming Liu, Mengzhen Liu, Zhenyu Wang, Lily Lee, Kaichen Zhou, Pengju An, Senqiao Yang, Renrui Zhang, Yandong Guo, Shanghang Zhang.
    [Paper] [Homepage] [Code]
  • MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation. [6 June, 2024] [ArXiv, 2024]
    Ionuţ Grigore, Călin-Adrian Popa.
    [Paper] [Code]
  • Efficient 3D Shape Generation via Diffusion Mamba with Bidirectional SSMs. [7 June, 2024] [ArXiv, 2024]
    Shentong Mo.
    [Paper]
  • HDMba: Hyperspectral Remote Sensing Imagery Dehazing with State Space Model. [9 June, 2024] [ArXiv, 2024]
    Hang Fu, Genyun Sun, Yinhe Li, Jinchang Ren, Aizhu Zhang, Cheng Jing, Pedram Ghamisi.
    [Paper] [Code]
  • Vision Mamba: Cutting-Edge Classification of Alzheimer's Disease with 3D MRI Scans. [9 June, 2024] [ArXiv, 2024]
    Muthukumar K A, Amit Gurung, Priya Ranjan.
    [Paper]
  • Convolution and Attention-Free Mamba-based Cardiac Image Segmentation. [9 June, 2024] [ArXiv, 2024]
    Abbas Khan, Muhammad Asad, Martin Benning, Caroline Roney, Gregory Slabaugh.
    [Paper]
  • Mamba YOLO: SSMs-Based YOLO For Object Detection. [9 June, 2024] [ArXiv, 2024]
    Zeyu Wang, Chen Li, Huiying Xu, Xinzhong Zhu.
    [Paper] [Code]
  • MHS-VM: Multi-Head Scanning in Parallel Subspaces for Vision Mamba. [9 June, 2024] [ArXiv, 2024]
    Zhongping Ji.
    [Paper] [Code]
  • PointABM:Integrating Bidirectional State Space Model with Multi-Head Self-Attention for Point Cloud Analysis. [10 June, 2024] [ArXiv, 2024]
    Jia-wei Chen, Yu-jie Xiong, Yong-bin Gao.
    [Paper]
  • DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification. [11 June, 2024] [ArXiv, 2024]
    Jiamu Sheng, Jingyi Zhou, Jiong Wang, Peng Ye, Jiayuan Fan.
    [Paper]
  • Autoregressive Pretraining with Mamba in Vision. [11 June, 2024] [ArXiv, 2024]
    Sucheng Ren, Xianhang Li, Haoqin Tu, Feng Wang, Fangxun Shu, Lei Zhang, Jieru Mei, Linjie Yang, Peng Wang, Heng Wang, Alan Yuille, Cihang Xie.
    [Paper] [Code]
  • PixMamba: Leveraging State Space Models in a Dual-Level Architecture for Underwater Image Enhancement. [12 June, 2024] [ArXiv, 2024]
    Wei-Tung Lin, Yong-Xiang Lin, Jyun-Wei Chen, Kai-Lung Hua.
    [Paper] [Code]
  • On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models. [12 June, 2024] [ArXiv, 2024]
    Hashmat Shadab Malik, Numan Saeed, Asif Hanif, Muzammal Naseer, Mohammad Yaqub, Salman Khan, Fahad Shahbaz Khan.
    [Paper] [Code]
  • Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment. [13 June, 2024] [ArXiv, 2024]
    Fengbin Guan, Xin Li, Zihao Yu, Yiting Lu, Zhibo Chen.
    [Paper]
  • Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection. [18 June, 2024] [ArXiv, 2024]
    Guowen Zhang, Lue Fan, Chenhang He, Zhen Lei, Zhaoxiang Zhang, Lei Zhang.
    [Paper] [Code]
  • PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery. [16 June, 2024] [ArXiv, 2024]
    Libo Wang, Dongxu Li, Sijun Dong, Xiaoliang Meng, Xiaokang Zhang, Danfeng Hong.
    [Paper] [Code]
  • LFMamba: Light Field Image Super-Resolution with State Space Model. [18 June, 2024] [ArXiv, 2024]
    Wang xia, Yao Lu, Shunzhou Wang, Ziqi Wang, Peiqi Xia, Tianfei Zhou.
    [Paper]
  • Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images. [20 June, 2024] [ArXiv, 2024]
    Qinfeng Zhu, Yuanzhi Cai, Lei Fan.
    [Paper] [Code]
  • Soft Masked Mamba Diffusion Model for CT to MRI Conversion. [22 June, 2024] [ArXiv, 2024]
    Zhenbin Wang, Lei Zhang, Lituan Wang, Zhenwei Zhang.
    [Paper] [Code]
  • Mamba-based Light Field Super-Resolution with Efficient Subspace Scanning. [23 June, 2024] [ArXiv, 2024]
    Ruisheng Gao, Zeyu Xiao, Zhiwei Xiong.
    [Paper]
  • Vision Mamba-based autonomous crack segmentation on concrete, asphalt, and masonry surfaces. [24 June, 2024] [ArXiv, 2024]
    Zhaohui Chen, Elyas Asadi Shamsabadi, Sheng Jiang, Luming Shen, Daniel Dias-da-Costa.
    [Paper]
  • Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model. [25 June, 2024] [ArXiv, 2024]
    Zhuoyuan Li, Yubo Ai, Jiahao Lu, ChuXin Wang, Jiacheng Deng, Hanzhi Chang, Yanzhe Liang, Wenfei Yang, Shifeng Zhang, Tianzhu Zhang.
    [Paper]
  • SUM: Saliency Unification through Mamba for Visual Attention Modeling. [25 June, 2024] [ArXiv, 2024]
    Alireza Hosseini, Amirhossein Kazerouni, Saeed Akhavan, Michael Brudno, Babak Taati.
    [Paper] [Code]
  • MMR-Mamba: Multi-Contrast MRI Reconstruction with Mamba and Spatial-Frequency Information Fusion. [27 June, 2024] [ArXiv, 2024]
    Jing Zou, Lanqing Liu, Qi Chen, Shujun Wang, Xiaohan Xing, Jing Qin.
    [Paper]
  • VideoMambaPro: A Leap Forward for Mamba in Video Understanding. [27 June, 2024] [ArXiv, 2024]
    Hui Lu, Albert Ali Salah, Ronald Poppe.
    [Paper] [Code]
  • Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model. [27 June, 2024] [ArXiv, 2024]
    Haobo Yuan, Xiangtai Li, Lu Qi, Tao Zhang, Ming-Hsuan Yang, Shuicheng Yan, Chen Change Loy.
    [Paper] [Code]

General Vision

1 High-level/Mid-level Vision

1.1 Vision Backbone with Mamba

  • Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model. [10 February, 2024] [ArXiv, 2024]
    Lianghui Zhu, Bencheng Liao, Qian Zhang, Xinlong Wang, Wenyu Liu, Xinggang Wang.
    [Paper] [Code]
  • VMamba: Visual State Space Model. [10 April, 2024] [ArXiv, 2024]
    Yue Liu, Yunjie Tian, Yuzhong Zhao, Hongtian Yu, Lingxi Xie, Yaowei Wang, Qixiang Ye, Yunfan Liu.
    [Paper] [Code]
  • Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data. [19 March, 2024] [ArXiv, 2024]
    Shufan Li, Harkanwar Singh, Aditya Grover.
    [Paper] [Code]
  • LocalMamba: Visual State Space Model with Windowed Selective Scan. [14 March, 2024] [ArXiv, 2024]
    Tao Huang, Xiaohuan Pei, Shan You, Fei Wang, Chen Qian, Chang Xu.
    [Paper] [Code]
  • EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba. [14 March, 2024] [ArXiv, 2024]
    Xiaohuan Pei, Tao Huang, Chang Xu.
    [Paper] [Code]
  • SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series. [24 April, 2024] [ArXiv, 2024]
    Badri N. Patro, Vijay S. Agneeswaran.
    [Paper] [Code]
  • PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition. [26 March, 2024] [ArXiv, 2024]
    Chenhongyi Yang, Zehui Chen, Miguel Espinosa, Linus Ericsson, Zhenyu Wang, Jiaming Liu, Elliot J. Crowley.
    [Paper] [Code]
  • On the low-shot transferability of [V]-Mamba. [15 March, 2024] [ArXiv, 2024]
    Diganta Misra, Jay Gala, Antonio Orvieto.
    [Paper]
  • DGMamba: Domain Generalization via Generalized State Space Model. [11 April, 2024] [ArXiv, 2024]
    Shaocong Long, Qianyu Zhou, Xiangtai Li, Xuequan Lu, Chenhao Ying, Yuan Luo, Lizhuang Ma, Shuicheng Yan.
    [Paper] [Code]
  • VMambaCC: A Visual State Space Model for Crowd Counting. [6 May, 2024] [ArXiv, 2024]
    Hao-Yuan Ma, Li Zhang, Shuai Shi.
    [Paper]
  • MambaOut: Do We Really Need Mamba for Vision? [14 May, 2024] [ArXiv, 2024]
    Weihao Yu, Xinchao Wang.
    [Paper] [Code]
  • Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model. [23 May, 2024] [ArXiv, 2024]
    Yuheng Shi, Minjing Dong, Chang Xu.
    [Paper] [Code]
  • Mamba-R: Vision Mamba ALSO Needs Registers. [23 May, 2024] [ArXiv, 2024]
    Feng Wang, Jiahao Wang, Sucheng Ren, Guoyizhe Wei, Jieru Mei, Wei Shao, Yuyin Zhou, Alan Yuille, Cihang Xie.
    [Paper] [Homepage] [Code]
  • Demystify Mamba in Vision: A Linear Attention Perspective. [26 May, 2024] [ArXiv, 2024]
    Dongchen Han, Ziyi Wang, Zhuofan Xia, Yizeng Han, Yifan Pu, Chunjiang Ge, Jun Song, Shiji Song, Bo Zheng, Gao Huang.
    [Paper] [Code]
  • Vim-F: Visual State Space Model Benefiting from Learning in the Frequency Domain. [28 May, 2024] [ArXiv, 2024]
    Juntao Zhang, Kun Bian, Peng Cheng, Wenbo An, Jianning Liu, Jun Zhou.
    [Paper] [Code]
  • Mamba YOLO: SSMs-Based YOLO For Object Detection. [9 June, 2024] [ArXiv, 2024]
    Zeyu Wang, Chen Li, Huiying Xu, Xinzhong Zhu.
    [Paper] [Code]
  • Autoregressive Pretraining with Mamba in Vision. [11 June, 2024] [ArXiv, 2024]
    Sucheng Ren, Xianhang Li, Haoqin Tu, Feng Wang, Fangxun Shu, Lei Zhang, Jieru Mei, Linjie Yang, Peng Wang, Heng Wang, Alan Yuille, Cihang Xie.
    [Paper] [Code]
  • Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model. [27 June, 2024] [ArXiv, 2024]
    Haobo Yuan, Xiangtai Li, Lu Qi, Tao Zhang, Ming-Hsuan Yang, Shuicheng Yan, Chen Change Loy.
    [Paper] [Code]

1.2 Video Analysis and Understanding

  • VideoMamba: State Space Model for Efficient Video Understanding. [March, 2024] [ArXiv, 2024]
    Kunchang Li, Xinhao Li, Yi Wang, Yinan He, Yali Wang, Limin Wang, Yu Qiao.
    [Paper] [Code]
  • Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding. [14 March, 2024] [ArXiv, 2024]
    Guo Chen, Yifei Huang, Jilan Xu, Baoqi Pei, Zhe Chen, Zhiqi Li, Jiahao Wang, Kunchang Li, Tong Lu, Limin Wang.
    [Paper] [Code]
  • RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos. [9 April, 2024] [ArXiv, 2024]
    Bochao Zou, Zizheng Guo, Xiaocheng Hu, Huimin Ma.
    [Paper] [Code]
  • VideoMambaPro: A Leap Forward for Mamba in Video Understanding. [27 June, 2024] [ArXiv, 2024]
    Hui Lu, Albert Ali Salah, Ronald Poppe.
    [Paper] [Code]
  • DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark. [30 May, 2024] [ArXiv, 2024]
    Haoxing Chen, Yan Hong, Zizheng Huang, Zhuoer Xu, Zhangxuan Gu, Yaohui Li, Jun Lan, Huijia Zhu, Jianfu Zhang, Weiqiang Wang, Huaxiong Li.
    [Paper] [Code]

1.3 Down-stream Visual Applications

  • Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning. [28 April, 2024] [ArXiv, 2024]
    Chi-Sheng Chen, Guan-Ying Chen, Dong Zhou, Di Jiang, Dai-Shi Chen.
    [Paper] [Code]
  • InsectMamba: Insect Pest Classification with State Space Model. [4 April, 2024] [ArXiv, 2024]
    Qianning Wang, Chenglin Wang, Zhixin Lai, Yucheng Zhou.
    [Paper]
  • MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection. [17 March, 2024] [ArXiv, 2024]
    Tianxiang Chen, Zhentao Tan, Tao Gong, Qi Chu, Yue Wu, Bin Liu, Jieping Ye, Nenghai Yu.
    [Paper] [Code]
  • MemoryMamba: Memory-Augmented State Space Model for Defect Recognition. [6 May, 2024] [ArXiv, 2024]
    Qianning Wang, He Hu, Yucheng Zhou.
    [Paper]
  • SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients. [5 May, 2024] [ArXiv, 2024]
    Tushar Verma, Jyotsna Singh, Yash Bhartari, Rishi Jarwal, Suraj Singh, Shubhkarman Singh.
    [Paper] [Code]
  • FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State Space. [9 May, 2024] [ArXiv, 2024]
    Hui Ma, Sen Lei, Turgay Celik, Heng-Chao Li.
    [Paper] [Code]
  • OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition. [13 May, 2024] [ArXiv, 2024]
    Qiuchi Xiang, Jintao Cheng, Jiehao Luo, Jin Wu, Rui Fan, Xieyuanli Chen, Xiaoyu Tang.
    [Paper]
  • TokenUnify: Scalable Autoregressive Visual Pre-training with Mixture Token Prediction. [27 May, 2024] [ArXiv, 2024]
    Yinda Chen, Haoyuan Shi, Xiaoyu Liu, Te Shi, Ruobing Zhang, Dong Liu, Zhiwei Xiong, Feng Wu.
    [Paper] [Code]
  • MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation. [6 June, 2024] [ArXiv, 2024]
    Ionuţ Grigore, Călin-Adrian Popa.
    [Paper] [Code]
  • Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment. [13 June, 2024] [ArXiv, 2024]
    Fengbin Guan, Xin Li, Zihao Yu, Yiting Lu, Zhibo Chen.
    [Paper]
  • Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection. [18 June, 2024] [ArXiv, 2024]
    Guowen Zhang, Lue Fan, Chenhang He, Zhen Lei, Zhaoxiang Zhang, Lei Zhang.
    [Paper] [Code]
  • Vision Mamba-based autonomous crack segmentation on concrete, asphalt, and masonry surfaces. [24 June, 2024] [ArXiv, 2024]
    Zhaohui Chen, Elyas Asadi Shamsabadi, Sheng Jiang, Luming Shen, Daniel Dias-da-Costa.
    [Paper]
  • SUM: Saliency Unification through Mamba for Visual Attention Modeling. [25 June, 2024] [ArXiv, 2024]
    Alireza Hosseini, Amirhossein Kazerouni, Saeed Akhavan, Michael Brudno, Babak Taati.
    [Paper] [Code]

2 Low-level Vision

2.1 Image Denoising and Enhancement

  • U-shaped Vision Mamba for Single Image Dehazing. [15 February, 2024] [ArXiv, 2024]
    Zhuoran Zheng, Chen Wu.
    [Paper] [Code]
  • FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining. [15 April, 2024] [ArXiv, 2024]
    Zou Zhen, Yu Hu, Zhao Feng.
    [Paper]
  • Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement. [6 May, 2024] [ArXiv, 2024]
    Jiesong Bai, Yuhao Yin, Qiyuan He.
    [Paper] [Code]
  • HC-Mamba: Vision MAMBA with Hybrid Convolutional Techniques for Medical Image Segmentation. [11 May, 2024] [ArXiv, 2024]
    Jiashu Xu.
    [Paper]
  • FourierMamba: Fourier Learning Integration with State Space Models for Image Deraining. [29 May, 2024] [ArXiv, 2024]
    Dong Li, Yidi Liu, Xueyang Fu, Senyan Xu, Zheng-Jun Zha.
    [Paper]
  • LLEMamba: Low-Light Enhancement via Relighting-Guided Mamba with Deep Unfolding Network. [3 June, 2024] [ArXiv, 2024]
    Xuanqi Zhang, Haijin Zeng, Jinwang Pan, Qiangqiang Shen, Yongyong Chen.
    [Paper]
  • PixMamba: Leveraging State Space Models in a Dual-Level Architecture for Underwater Image Enhancement. [12 June, 2024] [ArXiv, 2024]
    Wei-Tung Lin, Yong-Xiang Lin, Jyun-Wei Chen, Kai-Lung Hua.
    [Paper] [Code]

2.2 Image Restoration

  • MambaIR: A Simple Baseline for Image Restoration with State-Space Model. [25 March, 2024] [ArXiv, 2024]
    Hang Guo, Jinmin Li, Tao Dai, Zhihao Ouyang, Xudong Ren, Shu-Tao Xia.
    [Paper] [Code]
  • Activating Wider Areas in Image Super-Resolution. [13 March, 2024] [ArXiv, 2024]
    Cheng Cheng, Hang Wang, Hongbin Sun.
    [Paper]
  • CU-Mamba: Selective State Space Models with Channel Learning for Image Restoration. [17 April, 2024] [ArXiv, 2024]
    Rui Deng, Tianpei Gu.
    [Paper]
  • VmambaIR: Visual State Space Model for Image Restoration. [17 March, 2024] [ArXiv, 2024]
    Yuan Shi, Bin Xia, Xiaoyu Jin, Xing Wang, Tianyu Zhao, Xin Xia, Xuefeng Xiao, Wenming Yang.
    [Paper] [Code]
  • Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement. [6 May, 2024] [ArXiv, 2024]
    Jiesong Bai, Yuhao Yin, Qiyuan He.
    [Paper] [Code]
  • DVMSR: Distillated Vision Mamba for Efficient Super-Resolution. [11 May, 2024] [ArXiv, 2024]
    Xiaoyan Lei, Wenlong Zhang, Weifeng Cao.
    [Paper] [Code]
  • IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model. [16 May, 2024] [ArXiv, 2024]
    Yongsong Huang, Tomo Miyazaki, Xiaofeng Liu, Shinichiro Omachi.
    [Paper] [Code]
  • Scalable Visual State Space Model with Fractal Scanning. [26 May, 2024] [ArXiv, 2024]
    Lv Tang, HaoKe Xiao, Peng-Tao Jiang, Hao Zhang, Jinwei Chen, Bo Li.
    [Paper]
  • GMSR:Gradient-Guided Mamba for Spectral Reconstruction from RGB Images. [13 May, 2024] [ArXiv, 2024]
    Xinying Wang, Zhixiong Huang, Sifan Zhang, Jiawen Zhu, Lin Feng.
    [Paper] [Code]
  • Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging. [1 June, 2024] [ArXiv, 2024]
    Jiahua Dong, Hui Yin, Hongliu Li, Wenbo Li, Yulun Zhang, Salman Khan, Fahad Shahbaz Khan.
    [Paper] [Code]
  • LFMamba: Light Field Image Super-Resolution with State Space Model. [18 June, 2024] [ArXiv, 2024]
    Wang xia, Yao Lu, Shunzhou Wang, Ziqi Wang, Peiqi Xia, Tianfei Zhou.
    [Paper]
  • Mamba-based Light Field Super-Resolution with Efficient Subspace Scanning. [23 June, 2024] [ArXiv, 2024]
    Ruisheng Gao, Zeyu Xiao, Zhiwei Xiong.
    [Paper]

3 3-D Visual Recognition

3.1 Point Could Analysis

  • PointMamba: A Simple State Space Model for Point Cloud Analysis. [2 April, 2024] [ArXiv, 2024]
    Dingkang Liang, Xin Zhou, Xinyu Wang, Xingkui Zhu, Wei Xu, Zhikang Zou, Xiaoqing Ye, Xiang Bai.
    [Paper] [Code]
  • Point Cloud Mamba: Point Cloud Learning via State Space Model. [29 March, 2024] [ArXiv, 2024]
    Tao Zhang, Xiangtai Li, Haobo Yuan, Shunping Ji, Shuicheng Yan.
    [Paper] [Code]
  • Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy. [17 March, 2024] [ArXiv, 2024]
    Jiuming Liu, Ruiji Yu, Yian Wang, Yu Zheng, Tianchen Deng, Weicai Ye, Hesheng Wang.
    [Paper] [Code]
  • 3DMambaComplete: Exploring Structured State Space Model for Point Cloud Completion. [10 April, 2024] [ArXiv, 2024]
    Yixuan Li, Weidong Yang, Ben Fei.
    [Paper]
  • Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba. [9 May, 2024] [ArXiv, 2024]
    Hongwei Ren, Yue Zhou, Jiadong Zhu, Haotian Fu, Yulong Huang, Xiaopeng Lin, Yuetong Fang, Fei Ma, Hao Yu, Bojun Cheng.
    [Paper]
  • MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models. [23 May, 2024] [ArXiv, 2024]
    Jiuming Liu, Jinru Han, Lihao Liu, Angelica I. Aviles-Rivero, Chaokang Jiang, Zhe Liu, Hesheng Wang.
    [Paper]
  • PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud Learning. [24 May, 2024] [ArXiv, 2024]
    Qingdong He, Jiangning Zhang, Jinlong Peng, Haoyang He, Yabiao Wang, Chengjie Wang.
    [Paper] [Code]
  • PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis. [24 May, 2024] [ArXiv, 2024]
    Zicheng Wang, Zhenghao Chen, Yiming Wu, Zhen Zhao, Luping Zhou, Dong Xu.
    [Paper] [Code]
  • LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling. [27 May, 2024] [ArXiv, 2024]
    Yaohua Zha, Naiqi Li, Yanzi Wang, Tao Dai, Hang Guo, Bin Chen, Zhi Wang, Zhihao Ouyang, Shu-Tao Xia.
    [Paper]
  • Efficient 3D Shape Generation via Diffusion Mamba with Bidirectional SSMs. [7 June, 2024] [ArXiv, 2024]
    Shentong Mo.
    [Paper]
  • PointABM:Integrating Bidirectional State Space Model with Multi-Head Self-Attention for Point Cloud Analysis. [10 June, 2024] [ArXiv, 2024]
    Jia-wei Chen, Yu-jie Xiong, Yong-bin Gao.
    [Paper]
  • Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model. [25 June, 2024] [ArXiv, 2024]
    Zhuoyuan Li, Yubo Ai, Jiahao Lu, ChuXin Wang, Jiacheng Deng, Hanzhi Chang, Yanzhe Liang, Wenfei Yang, Shifeng Zhang, Tianzhu Zhang.
    [Paper]

3.2 Hyperspectral Imaging Analysis

  • Mamba-FETrack: Frame-Event Tracking via State Space Model. [28 April, 2024] [ArXiv, 2024]
    Ju Huang, Shiao Wang, Shuai Wang, Zhe Wu, Xiao Wang, Bo Jiang.
    [Paper] [Code]
  • 3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification. [21 May, 2024] [ArXiv, 2024]
    Yan He, Bing Tu, Bo Liu, Jun Li, Antonio Plaza.
    [Paper]
  • DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification. [11 June, 2024] [ArXiv, 2024]
    Jiamu Sheng, Jingyi Zhou, Jiong Wang, Peng Ye, Jiayuan Fan.
    [Paper]

4 Visual Data Generation

  • ZigMa: A DiT-style Zigzag Mamba Diffusion Model. [1 April, 2024] [ArXiv, 2024]
    Vincent Tao Hu, Stefan Andreas Baumann, Ming Gui, Olga Grebenkova, Pingchuan Ma, Johannes Fischer, Björn Ommer.
    [Paper] [Homepage] [Code]
  • Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM. [19 March, 2024] [ArXiv, 2024]
    Zeyu Zhang, Akide Liu, Ian Reid, Richard Hartley, Bohan Zhuang, Hao Tang.
    [Paper] [Homepage] [Code]
  • Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction. [29 March, 2024] [ArXiv, 2024]
    Qiuhong Shen, Xuanyu Yi, Zike Wu, Pan Zhou, Hanwang Zhang, Shuicheng Yan, Xinchao Wang.
    [Paper]
  • Matten: Video Generation with Mamba-Attention. [5 May, 2024] [ArXiv, 2024]
    Yu Gao, Jiancheng Huang, Xiaopeng Sun, Zequn Jie, Yujie Zhong, Lin Ma.
    [Paper]
  • SMCD: High Realism Motion Style Transfer via Mamba-based Diffusion. [5 May, 2024] [ArXiv, 2024]
    Ziyun Qian, Zeyu Xiao, Zhenyi Wu, Dingkang Yang, Mingcheng Li, Shunli Wang, Shuaibing Wang, Dongliang Kou, Lihua Zhang.
    [Paper]
  • DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis. [23 May, 2024] [ArXiv, 2024]
    Yao Teng, Yue Wu, Han Shi, Xuefei Ning, Guohao Dai, Yu Wang, Zhenguo Li, Xihui Liu.
    [Paper] [Code]
  • Scaling Diffusion Mamba with Bidirectional SSMs for Efficient Image and Video Generation. [24 May, 2024] [ArXiv, 2024]
    Shentong Mo, Yapeng Tian.
    [Paper]
  • Deciphering Movement: Unified Trajectory Generation Model for Multi-Agent. [27 May, 2024] [ArXiv, 2024]
    Yi Xu, Yun Fu.
    [Paper] [Code]
  • DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention. [28 May, 2024] [ArXiv, 2024]
    Lianghui Zhu, Zilong Huang, Bencheng Liao, Jun Hao Liew, Hanshu Yan, Jiashi Feng, Xinggang Wang.
    [Paper] [Code]
  • Dimba: Transformer-Mamba Diffusion Models. [3 June, 2024] [ArXiv, 2024]
    Zhengcong Fei, Mingyuan Fan, Changqian Yu, Debang Li, Youqiang Zhang, Junshi Huang.
    [Paper] [Homepage] [Code]

Multi-Modal

1 Heterologous Stream

1.1 Multi-Modal Understanding

  • MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models. [14 March, 2024] [ArXiv, 2024]
    Zunnan Xu, Yukang Lin, Haonan Han, Sicheng Yang, Ronghui Li, Yachao Zhang, Xiu Li.
    [Paper]
  • ReMamber: Referring Image Segmentation with Mamba Twister. [26 March, 2024] [ArXiv, 2024]
    Yuhuan Yang, Chaofan Ma, Jiangchao Yao, Zhun Zhong, Ya Zhang, Yanfeng Wang.
    [Paper]
  • SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding. [1 April, 2024] [ArXiv, 2024]
    Wenrui Li, Xiaopeng Hong, Xiaopeng Fan.
    [Paper]

1.2 Multimodal large language models

  • VL-Mamba: Exploring State Space Models for Multimodal Learning. [20 March, 2024] [ArXiv, 2024]
    Yanyuan Qiao, Zheng Yu, Longteng Guo, Sihan Chen, Zijia Zhao, Mingzhen Sun, Qi Wu, Jing Liu.
    [Paper] [Homepage] [Code]
  • Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference. [22 March, 2024] [ArXiv, 2024]
    Han Zhao, Min Zhang, Wei Zhao, Pengxiang Ding, Siteng Huang, Donglin Wang.
    [Paper] [Homepage] [Code]
  • CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation. [30 April, 2024] [ArXiv, 2024]
    Weiquan Huang, Yifei Shen, Yifan Yang.
    [Paper] [Code]
  • Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models. [8 May, 2024] [ArXiv, 2024]
    Zhengxing Lan, Hongbo Li, Lingshan Liu, Bo Fan, Yisheng Lv, Yilong Ren, Zhiyong Cui.
    [Paper]
  • Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models. [27 May, 2024] [ArXiv, 2024]
    Byung-Kwan Lee, Chae Won Kim, Beomchan Park, Yong Man Ro.
    [Paper] [Code]
  • RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation. [6 June, 2024] [ArXiv, 2024]
    Jiaming Liu, Mengzhen Liu, Zhenyu Wang, Lily Lee, Kaichen Zhou, Pengju An, Senqiao Yang, Renrui Zhang, Yandong Guo, Shanghang Zhang.
    [Paper] [Homepage] [Code]

2 Homologous Stream

  • Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation. [5 April, 2024] [ArXiv, 2024]
    Zifu Wan, Yuhao Wang, Silong Yong, Pingping Zhang, Simon Stepputtis, Katia Sycara, Yaqi Xie.
    [Paper] [Code]
  • Fusion-Mamba for Cross-modality Object Detection. [14 April, 2024] [ArXiv, 2024]
    Wenhao Dong, Haodong Zhu, Shaohui Lin, Xiaoyan Luo, Yunhang Shen, Xuhui Liu, Juan Zhang, Guodong Guo, Baochang Zhang.
    [Paper]

Vertical Application

1 Remote Sensing Image

1.1 Remote Sensing Image Processing

  • Pan-Mamba: Effective pan-sharpening with State Space Model. [8 March, 2024] [ArXiv, 2024]
    Xuanhua He, Ke Cao, Keyu Yan, Rui Li, Chengjun Xie, Jie Zhang, Man Zhou.
    [Paper] [Code]
  • HSIDMamba: Exploring Bidirectional State-Space Models for Hyperspectral Denoising. [15 April, 2024] [ArXiv, 2024]
    Yang Liu, Jiahua Xiao, Yu Guo, Peilin Jiang, Haiwei Yang, Fei Wang.
    [Paper]
  • SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising. [15 May, 2024] [ArXiv, 2024]
    Guanyiman Fu, Fengchao Xiong, Jianfeng Lu, Jun Zhou, Yuntao Qian.
    [Paper] [Code]
  • Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution. [8 May, 2024] [ArXiv, 2024]
    Yi Xiao, Qiangqiang Yuan, Kui Jiang, Yuzeng Chen, Qiang Zhang, Chia-Wen Lin.
    [Paper]
  • RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing. [16 May, 2024] [ArXiv, 2024]
    Huiling Zhou, Xianhao Wu, Hongming Chen, Xiang Chen, Xin He.
    [Paper]
  • HDMba: Hyperspectral Remote Sensing Imagery Dehazing with State Space Model. [9 June, 2024] [ArXiv, 2024]
    Hang Fu, Genyun Sun, Yinhe Li, Jinchang Ren, Aizhu Zhang, Cheng Jing, Pedram Ghamisi.
    [Paper] [Code]

1.2 Remote Sensing Image Classification

  • RSMamba: Remote Sensing Image Classification with State Space Model. [28 March, 2024] [ArXiv, 2024]
    Keyan Chen, Bowen Chen, Chenyang Liu, Wenyuan Li, Zhengxia Zou, Zhenwei Shi.
    [Paper]
  • SpectralMamba: Efficient Mamba for Hyperspectral Image Classification. [12 April, 2024] [ArXiv, 2024]
    Jing Yao, Danfeng Hong, Chenyu Li, Jocelyn Chanussot.
    [Paper] [Code]
  • Spectral-Spatial Mamba for Hyperspectral Image Classification. [29 Apr, 2024] [ArXiv, 2024]
    Lingbo Huang, Yushi Chen, Xin He.
    [Paper] [Code]
  • S2Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification. [28 April, 2024] [ArXiv, 2024]
    Guanchun Wang, Xiangrong Zhang, Zelin Peng, Tianyang Zhang, Xiuping Jia, Licheng Jiao.
    [Paper] [Code]
  • Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification. [25 May, 2024] [ArXiv, 2024]
    Weilian Zhou, Sei-Ichiro Kamata, Haipeng Wang, Man-Sing Wong, Huiying, Hou.
    [Paper] [Code]

1.3 Remote Sensing Image Change Detection

  • ChangeMamba: Remote Sensing Change Detection with Spatio-Temporal State Space Model. [14 April, 2024] [ArXiv, 2024]
    Hongruixuan Chen, Jian Song, Chengxi Han, Junshi Xia, Naoto Yokoya.
    [Paper] [Code]
  • RSCaMa: Remote Sensing Image Change Captioning with State Space Model. [2 May, 2024] [ArXiv, 2024]
    Chenyang Liu, Keyan Chen, Bowen Chen, Haotian Zhang, Zhengxia Zou, Zhenwei Shi.
    [Paper] [Code]
  • CDMamba: Remote Sensing Image Change Detection with Mamba. [6 June, 2024] [ArXiv, 2024]
    Haotian Zhang, Keyan Chen, Chenyang Liu, Hao Chen, Zhengxia Zou, Zhenwei Shi.
    [Paper] [Code]

1.4 Remote Sensing Image Segmentation

  • Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model. [11 April, 2024] [ArXiv, 2024]
    Qinfeng Zhu, Yuanzhi Cai, Yuan Fang, Yihan Yang, Cheng Chen, Lei Fan, Anh Nguyen.
    [Paper] [Code]
  • RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation. [3 April, 2024] [ArXiv, 2024]
    Xianping Ma, Xiaokang Zhang, Man-On Pun.
    [Paper] [Code]
  • RS-Mamba for Large Remote Sensing Image Dense Prediction. [10 April, 2024] [ArXiv, 2024]
    Sijie Zhao, Hao Chen, Xueliang Zhang, Pengfeng Xiao, Lei Bai, Wanli Ouyang.
    [Paper] [Code]
  • Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study. [14 May, 2024] [ArXiv, 2024]
    Qinfeng Zhu, Yuan Fang, Yuanzhi Cai, Cheng Chen, Lei Fan.
    [Paper]
  • CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation. [17 May, 2024] [ArXiv, 2024]
    Mushui Liu, Jun Dan, Ziqian Lu, Yunlong Yu, Yingming Li, Xi Li.
    [Paper] [Code]
  • PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery. [16 June, 2024] [ArXiv, 2024]
    Libo Wang, Dongxu Li, Sijun Dong, Xiaoliang Meng, Xiaokang Zhang, Danfeng Hong.
    [Paper] [Code]
  • Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images. [20 June, 2024] [ArXiv, 2024]
    Qinfeng Zhu, Yuanzhi Cai, Lei Fan.
    [Paper] [Code]

1.5 Remote Sensing Image Fusion

  • FusionMamba: Efficient Image Fusion with State Space Model. [11 April, 2024] [ArXiv, 2024]
    Siran Peng, Xiangyu Zhu, Haoyu Deng, Zhen Lei, Liang-Jian Deng.
    [Paper]
  • A Novel State Space Model with Local Enhancement and State Sharing for Image Fusion. [14 April, 2024] [ArXiv, 2024]
    Zihan Cao, Xiao Wu, Liang-Jian Deng, Yu Zhong.
    [Paper]

2 Medical Image

2.1 Medical Image Segmentation

2.1.1 Preliminary explorations of U-shaped Mamba
  • U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation. [9 January, 2024] [ArXiv, 2024]
    Jun Ma, Feifei Li, Bo Wang.
    [Paper] [Homepage] [Code]
  • VM-UNet: Vision Mamba UNet for Medical Image Segmentation. [4 February, 2024] [ArXiv, 2024]
    Jiacheng Ruan, Suncheng Xiang.
    [Paper] [Code]
  • Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation. [30 March, 2024] [ArXiv, 2024]
    Ziyang Wang, Jian-Qing Zheng, Yichi Zhang, Ge Cui, Lei Li.
    [Paper] [Code]
  • Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining. [6 March, 2024] [ArXiv, 2024]
    Jiarun Liu, Hao Yang, Hong-Yu Zhou, Yan Xi, Lequan Yu, Yizhou Yu, Yong Liang, Guangming Shi, Shaoting Zhang, Hairong Zheng, Shanshan Wang.
    [Paper] [Code]
2.1.2 Improvements to the U-shaped Mamba
  • LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation. [11 March, 2024] [ArXiv, 2024]
    Weibin Liao, Yinghao Zhu, Xinyuan Wang, Chengwei Pan, Yasha Wang, Liantao Ma.
    [Paper] [Code]
  • VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation . [14 March, 2024] [ArXiv, 2024]
    Mingya Zhang, Yue Yu, Limei Gu, Tingsheng Lin, Xianping Tao.
    [Paper] [Code]
  • Large Window-based Mamba UNet for Medical Image Segmentation: Beyond Convolution and Self-attention. [12 March, 2024] [ArXiv, 2024]
    Jinhong Wang, Jintai Chen, Danny Chen, Jian Wu.
    [Paper] [Code]
  • H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation. [20 March, 2024] [ArXiv, 2024]
    Renkai Wu, Yinghao Liu, Pengchen Liang, Qing Chang.
    [Paper] [Code]
  • Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion. [26 Mar, 2024] [ArXiv, 2024]
    Kazi Shahriar Sanjid, Md. Tanzim Hossain, Md. Shakib Shahariar Junayed, Dr. Mohammad Monir Uddin.
    [Paper]
  • Rotate to Scan: UNet-like Mamba with Triplet SSM Module for Medical Image Segmentation. [16 April, 2024] [ArXiv, 2024]
    Hao Tang, Lianglun Cheng, Guoheng Huang, Zhengguang Tan, Junhao Lu, Kaihong Wu.
    [Paper]
  • UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion Segmentation. [24 April, 2024] [ArXiv, 2024]
    Renkai Wu, Yinghao Liu, Pengchen Liang, Qing Chang.
    [Paper] [Code]
  • AC-MAMBASEG: An adaptive convolution and Mamba-based architecture for enhanced skin lesion segmentation. [5 May, 2024] [ArXiv, 2024]
    Xiaoyan Lei, Wenlong Zhang, Weifeng Cao.
    [Paper] [Code]
  • MUCM-Net: A Mamba Powered UCM-Net for Skin Lesion Segmentation. [24 May, 2024] [ArXiv, 2024]
    Chunyu Yuan, Dongfang Zhao, Sos S. Agaian.
    [Paper] [Code]
  • Convolution and Attention-Free Mamba-based Cardiac Image Segmentation. [9 June, 2024] [ArXiv, 2024]
    Abbas Khan, Muhammad Asad, Martin Benning, Caroline Roney, Gregory Slabaugh.
    [Paper]
  • MHS-VM: Multi-Head Scanning in Parallel Subspaces for Vision Mamba. [9 June, 2024] [ArXiv, 2024]
    Zhongping Ji.
    [Paper] [Code]
2.1.3 U-shaped Mamba with other methodologies
  • Semi-Mamba-UNet: Pixel-Level Contrastive and Pixel-Level Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image Segmentation. [29 March, 2024] [ArXiv, 2024]
    Chao Ma, Ziyang Wang.
    [Paper] [Code]
  • Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation. [16 February, 2024] [ArXiv, 2024]
    Ziyang Wang, Chao Ma.
    [Paper] [Code]
  • ProMamba: Prompt-Mamba for polyp segmentation. [26 March, 2024] [ArXiv, 2024]
    Jianhao Xie, Ruofan Liao, Ziang Zhang, Sida Yi, Yuesheng Zhu, Guibo Luo.
    [Paper]
  • P-Mamba: Marrying Perona Malik Diffusion with Mamba for Efficient Pediatric Echocardiographic Left Ventricular Segmentation. [15 March, 2024] [ArXiv, 2024]
    Zi Ye, Tianxiang Chen, Fangyijie Wang, Hanwei Zhang, Guanxi Li, Lijun Zhang.
    [Paper]
2.1.4 Multi-Dimensional Medical Data Segmentation
  • SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation. [25 February, 2024] [ArXiv, 2024]
    Zhaohu Xing, Tian Ye, Yijun Yang, Guang Liu, Lei Zhu.
    [Paper] [Code]
  • nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model. [10 March, 2024] [ArXiv, 2024]
    Haifan Gong, Luoyao Kang, Yitao Wang, Xiang Wan, Haofeng Li.
    [Paper] [Code]
  • T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation. [1 April, 2024] [ArXiv, 2024]
    Jing Hao, Lei He, Kuo Feng Hung.
    [Paper] [Code]
  • Vivim: a Video Vision Mamba for Medical Video Object Segmentation. [12 March, 2024] [ArXiv, 2024]
    Yijun Yang, Zhaohu Xing, Chunwang Huang, Lei Zhu.
    [Paper] [Code]

2.2 Pathological Diagnosis

  • MedMamba: Vision Mamba for Medical Image Classification. [2 April, 2024] [ArXiv, 2024]
    Yubiao Yue, Zhenzhang Li.
    [Paper] [Code]
  • MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models. [8 March, 2024] [ArXiv, 2024]
    Zijie Fang, Yifeng Wang, Zhi Wang, Jian Zhang, Xiangyang Ji, Yongbing Zhang.
    [Paper]
  • MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational Pathology. [11 March, 2024] [ArXiv, 2024]
    Shu Yang, Yihui Wang, Hao Chen.
    [Paper] [Code]
  • CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal Representation Learning for AD classification. [25 March, 2024] [ArXiv, 2024]
    Guangqian Yang, Kangrui Du, Zhihan Yang, Ye Du, Yongping Zheng, Shujun Wang.
    [Paper]
  • SurvMamba: State Space Model with Multi-grained Multi-modal Interaction for Survival Prediction. [11 April, 2024] [ArXiv, 2024]
    Ying Chen, Jiajing Xie, Yuxiang Lin, Yuhang Song, Wenxian Yang, Rongshan Yu.
    [Paper]
  • Cardiovascular Disease Detection from Multi-View Chest X-rays with BI-Mamba. [28 May, 2024] [ArXiv, 2024]
    Zefan Yang, Jiajin Zhang, Ge Wang, Mannudeep K. Kalra, Pingkun Yan.
    [Paper]
  • MGI: Multimodal Contrastive pre-training of Genomic and Medical Imaging. [2 June, 2024] [ArXiv, 2024]
    Jiaying Zhou, Mingzhou Jiang, Junde Wu, Jiayuan Zhu, Ziyue Wang, Yueming Jin.
    [Paper]
  • Vision Mamba: Cutting-Edge Classification of Alzheimer's Disease with 3D MRI Scans. [9 June, 2024] [ArXiv, 2024]
    Muthukumar K A, Amit Gurung, Priya Ranjan.
    [Paper]

2.3 Deformable Image Registration

  • MambaMorph: a Mamba-based Framework for Medical MR-CT Deformable Registration. [12 March, 2024] [ArXiv, 2024]
    Tao Guo, Yinuo Wang, Shihao Shu, Diansheng Chen, Zhouping Tang, Cai Meng, Xiangzhi Bai.
    [Paper] [Code]
  • VMambaMorph: a Visual Mamba-based Framework with Cross-Scan Module for Deformable 3D Image Registration. [7 Apr, 2024] [ArXiv, 2024]
    Ziyang Wang, Jian-Qing Zheng, Chao Ma, Tao Guo.
    [Paper] [Code]

2.4 Medical Image Reconstruction

  • FD-Vision Mamba for Endoscopic Exposure Correction. [14 February, 2024] [ArXiv, 2024]
    Zhuoran Zheng, Jun Zhang.
    [Paper] [Code]
  • MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty Estimation. [19 March, 2024] [ArXiv, 2024]
    Jiahao Huang, Liutao Yang, Fanwen Wang, Yinzhe Wu, Yang Nan, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Daoqiang Zhang, Guang Yang.
    [Paper] [Code]
  • FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba. [20 April, 2024] [ArXiv, 2024]
    Xinyu Xie, Yawen Cui, Chio-In Ieong, Tao Tan, Xiaozhi Zhang, Xubin Zheng, Zitong Yu.
    [Paper] [Code]
  • MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion. [12 April, 2024] [ArXiv, 2024]
    Zhe Li, Haiwei Pan, Kejia Zhang, Yuhua Wang, Fengming Yu.
    [Paper]
  • Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba. [27 May, 2024] [ArXiv, 2024]
    Jiahao Huang, Liutao Yang, Fanwen Wang, Yinzhe Wu, Yang Nan, Weiwen Wu, Chengyan Wang, Kuangyu Shi, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Daoqiang Zhang, Guang Yang.
    [Paper]
  • MMR-Mamba: Multi-Contrast MRI Reconstruction with Mamba and Spatial-Frequency Information Fusion. [27 June, 2024] [ArXiv, 2024]
    Jing Zou, Lanqing Liu, Qi Chen, Shujun Wang, Xiaohan Xing, Jing Qin.
    [Paper]

2.5 Other Medical Tasks

  • MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose Prediction. [13 March, 2024] [ArXiv, 2024]
    Linjie Fu, Xia Li, Xiuding Cai, Yingkai Wang, Xueyao Wang, Yali Shen, Yu Yao.
    [Paper] [Code]

  • Motion-Guided Dual-Camera Tracker for Low-Cost Skill Evaluation of Gastric Endoscopy. [20 April, 2024] [ArXiv, 2024]
    Yuelin Zhang, Wanquan Yan, Kim Yan, Chun Ping Lam, Yufu Qiu, Pengyu Zheng, Raymond Shing-Yan Tang, Shing Shin Cheng.
    [Paper] [Code]

  • VM-DDPM: Vision Mamba Diffusion for Medical Image Synthesis. [9 May, 2024] [ArXiv, 2024]
    Zhihan Ju, Wanting Zhou.
    [Paper]

  • I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling. [22 May, 2024] [ArXiv, 2024]
    Omer F. Atli, Bilal Kabas, Fuat Arslan, Mahmut Yurt, Onat Dalmaz, Tolga Çukur.
    [Paper] [Code]

  • On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models. [12 June, 2024] [ArXiv, 2024]
    Hashmat Shadab Malik, Numan Saeed, Asif Hanif, Muzammal Naseer, Mohammad Yaqub, Salman Khan, Fahad Shahbaz Khan.
    [Paper] [Code]

  • Soft Masked Mamba Diffusion Model for CT to MRI Conversion. [22 June, 2024] [ArXiv, 2024]
    Zhenbin Wang, Lei Zhang, Lituan Wang, Zhenwei Zhang.
    [Paper] [Code]

Other Domains

coming soon