/MP2Net

The official code of our TGRS'24 paper MP2Net: Mask Propagation and Motion Prediction Network for Multi-Object Tracking in Satellite Videos

Primary LanguagePythonMIT LicenseMIT

MP^2^Net

The official PyTorch implementation of our TGRS 2024 paper:

MP2Net: Mask Propagation and Motion Prediction Network for Multi-Object Tracking in Satellite Videos

Highlight

Brief Introduction

We propose a novel joint-detection-and-tracking framework, MP^2^Net, for multi-object tracking in satellite videos. MP^2^Net utilizes mask propagation and matching mechanism assist tiny target and local region detection and introduces implicit and explicit motion prediction strategies to facilitate the interaction of detection and tracking. Experimental results on two large-scale datasets demonstrate the effectiveness and robustness of MP^2^Net, achieving state-of-the-art performance on typical moving objects in satellite videos, such as 66.9% MOTA and 76.0% IDF1 on the SatVideoDT challenge dataset.

For more detailed information, please refer to the paper.

Overall Performance1

Variant Dataset Category MOTA IDF1 IDP IDR Rcll Prcn IDs FP FN MT ML
Ours_DLADCN SatVideoDT car 66.9% 76.0% 88.5% 66.6% 71.2% 94.5% 520 11536 80961 805 159
Ours_ResFPN SatVideoDT car 64.7% 73.6% 88.8% 62.9% 67.9% 95.7% 512 8459 90183 759 184
Ours_DLADCN SatMTB car 60.3% 71.8% 88.0% 60.6% 64.7% 94.0% 986 14090 121319 1468 647
airplane 72.4% 83.3% 90.3% 77.3% 79.0% 92.3% 11 1028 3275 51 7
ship 46.6% 64.5% 74.0% 57.2% 62.2% 80.5% 98 3281 8227 45 18

Release

Trained Models [model weights] and Raw Results [raw results] for both ICPR SatVideoDT dataset and SatMTB dataset.

Getting Started

Environment

Our experiments are conducted with python3.6, pytorch1.7.0, and CUDA 10.2.

You can follow CenterNet to build the conda environment and use the latested version of DCNv2 under PyTorch 1.x.

Install evaluation requirements using pip install motmetrics

Data Preparation

The ICPR SatVideoDT dataset used here is available in https://satvideodt.github.io. (An extensive version of VISO dataset, with more data and corrected annotations)

The SatMTB dataset is available in BaiduYun with password s5y7, and the unzip password is CSU@2023.

Our collected json file is available in json

-dataset
	|-ICPR
		|-train_data
			|-001
			|-002
			...
		|-val_data
			|-001
			|-002
			...
		|-annotations
			|-instances_train_caronly.json
			|-instances_val_caronly.json
	
	|-SatMTB
		|-train
			|-img
				|-airplane
					|xx
					...
				|-car
				|-ship
				|-train
			|-label
				|-airplane
					|xx.txt
					...
				|-car
				|-ship
				|-train
		|-test
			...(same with train)
		|-instances_SatMTB_train.json
		|-instances_SatMTB_test.json

Training

Training for SatVideoDT.

python train.py --model_name DLADCN --gpus 0,1 --lr 1.25e-4 --lr_step 14 --num_epochs 15 --batch_size 4 --seqLen 5 --datasetname ICPR --data_dir  ./data/ICPR/

Training for SatMTB.

python train_satmtb.py --model_name DLADCN --gpus 0,1 --lr 1.25e-4 --lr_step 19 --num_epochs 20 --batch_size 4 --seqLen 5 --num_classes 3 --datasetname SatMTB --data_dir ./data/SatMTB/

Testing

Testing for SatVideoDT.

python testDis.py --model_name DLADCN --gpus 0 --load_model ./checkpoints/MP2Net_ICPR_DLADCN.pth --datasetname ICPR --data_dir  ./data/ICPR/ 

Testing for SatMTB.

python testDis_satmtb.py --model_name DLADCN --gpus 0 --load_model ./checkpoints/MP2Net_SatMTB_DLADCN.pth --datasetname SatMTB --data_dir ./data/SatMTB/ --num_classes 3

(Optional) Testing with SORT.

python testTrackingSort.py --model_name DLADCN --gpus 0 --load_model ./checkpoints/MP2Net_ICPR_DLADCN.pth --datasetname ICPR --data_dir  ./data/ICPR/

Evaluation

Evaluation for SatVideoDT.

python eval.py

Evaluation for SatMTB.

python eval_satmtb.py

Acknowledgement

Our idea is built upon the following projects. We really appreciate their excellent open-source works!

Citation

If any parts of our paper and code help your research, please consider citing us and giving a star to our repository.

@article{zhao2024mp2net,
  author={Zhao, Manqi and Li, Shengyang and Wang, Han and Yang, Jian and Sun, Yuhan and Gu, Yanfeng},
  journal={IEEE Transactions on Geoscience and Remote Sensing}, 
  title={MP2Net: Mask Propagation and Motion Prediction Network for Multiobject Tracking in Satellite Videos}, 
  year={2024},
  volume={62},
  pages={1-15},
  publisher={IEEE}
}

Contact

If you have any questions or concerns, feel free to open issues or contact me through email [zhaomanqi19@csu.ac.cn].

Footnotes

  1. Note that the results presented here differ slightly from those in the original article since we have made some optimizations to enhance our model's performance on cars. The variations in the results from the SatMTB dataset arise from using different training and test splits. We re-trained and tested our model using the publicly released version, and the results are summarized in the table. Rest assured, these new results do not affect the conclusions drawn in the original article.