/DS-Net

[CVPR 2021] Rank 1st in the public leaderboard of SemanticKITTI Panoptic Segmentation (2020-11-16)

Primary LanguagePythonMIT LicenseMIT

PWC

LiDAR-based Panoptic Segmentation via Dynamic Shifting Network

teaser

This repository provides the official implementation for the following two papers:

LiDAR-based Panoptic Segmentation via Dynamic Shifting Network
Fangzhou Hong, Hui Zhou, Xinge Zhu, Hongsheng Li, Ziwei Liu
Accepted to CVPR 2021
arXiv | CVF Open Access

LiDAR-based 4D Panoptic Segmentation via Dynamic Shifting Network
Fangzhou Hong, Hui Zhou, Xinge Zhu, Hongsheng Li, Ziwei Liu
arXiv Preprint, 2022
arXiv

For further information, please contact Fangzhou Hong.

News

Requirements

Data Preparation

Please download the SemanticKITTI dataset to the folder data and the structure of the folder should look like:

./
├── 
├── ...
└── data/
    ├──sequences
        ├── 00/           
        │   ├── velodyne/	
        |   |	├── 000000.bin
        |   |	├── 000001.bin
        |   |	└── ...
        │   └── labels/ 
        |       ├── 000000.label
        |       ├── 000001.label
        |       └── ...
        ├── 08/ # for validation
        ├── 11/ # 11-21 for testing
        └── 21/
	        └── ...

Getting Started

The training pipeline of our DS-Net consists of three steps: 1) semantic segmentation training; 2) center regression training; 3) dynamic shifting training. The first two steps give us the backbone model. The last step gives our DS-Net. We provide the corresponding pretrained model of each step. The inferencing and training details are further explained in this section.

Note that our implementation only supports parallel training for now. We fix the batch size of each GPUs to 1. In the first line of each script, you could choose the number of GPUs ${ngpu} you wish to use for training or inferencing. In the second line, you could set the folder name ${tag} and all the generated files will be put into ./output/${tag}. All the provided pytorch distributed version of scripts are not tested due to the lack of proper environment. All the slurm version of scripts are tested and should work well. Should there be any problem, feel free to open an issue.

Pretrained Models

DS-Net

If you wish to use our pretrained models, remember to create a new folder pretrained_weight and put all the downloaded models there.

Step Download Link
1 sem_pretrain.pth
2 offset_pretrain_pq_0.564.pth
3 dsnet_pretrain_pq_0.577.pth

4D-DS-Net

This is the trained final model. Download Link: checkpoint_epoch_5_0.640_0.594_0.648.pth.

Inferencing with the Pretrained Models

We provide inferencing scripts for the backbone and our DS-Net.

Backbone

Our backbone consists of the semantic segmentation module, the center regression module, a heuristic clustering algorithm and the consensus-driven fusion module. You are welcomed to play around with different heuristic algorithms and their parameter settings in ./cfgs/release/backbone.yaml since we provide several clustering algorithms in ./utils/clustering.py.

The inferencing scripts of our backbone are ./scripts/release/backbone/val_*.sh. Before using the scripts, please make sure you have downloaded the pretrained model (of step 2) or put the models trained by yourself (in step 2) to ./pretrained_weight and make sure to pass the correct path of the model to --pretrained_ckpt option.

DS-Net

The inferencing scripts of our DS-Net are in ./scripts/release/dsnet. val_*.sh are for inferencing on the validation set of SemanticKITTI. test_*.sh are for inferencing on the test set of SemanticKITTI and will generate prediction files under the corresponding output folder. Before using the scripts, remember to download the pretrained model (of step 3) or put the model trained by yourself (in step 3) to ./pretrained_weight and make sure you pass the right path to --pretrained_ckpt option in the scripts.

DS-Net: Training from Scratch

1. Semantic segmentation training

The training codes and scripts for this step will be released soon. For now, please download the step 1 pretrained model using the above link. Please note that the cylinder backbone used in our implementation is the original version of Cylinder3D instead of the latest version.

2. Center regression training

The training scripts of this step could be found in ./scripts/release/backbone/train_*.sh. Before using the training scripts, please download the pretrained model of step 1 to folder ./pretrained_weight. Feel free to play around with different parameter settings in ./cfgs/release/backbone.yaml.

3. Dynamic shifting training

The training scripts of step 3 could be found in ./scripts/release/dsnet/train_*.sh. Before using the training scripts of this part, please download the pretrained model (of step 2) to folder ./pretrained_weight or put the model trained (in step 2) to ./pretrained_weight and change the --pretrained_ckpt option to the correct path. You could experiment with different parameter settings in ./cfgs/release/dsnet.yaml.

4D-DS-Net: Training from Scratch

For the trained models, checkout here.

1. Backbone finetune

Please checkout ./scripts/release/4d-dsnet/train_backbone_multi_frames_2.sh.

2. 4D dynamic shifting training

Please checkout ./scripts/release/4d-dsnet/train_dsnet_multi_frames_2.sh.

License

Distributed under the MIT License. See LICENSE for more information.

Citation

If you find our work useful in your research, please consider citing the following papers:

@InProceedings{Hong_2021_CVPR,
    author    = {Hong, Fangzhou and Zhou, Hui and Zhu, Xinge and Li, Hongsheng and Liu, Ziwei},
    title     = {LiDAR-Based Panoptic Segmentation via Dynamic Shifting Network},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {13090-13099}
}

@article{hong20224ddsnet,
  title={LiDAR-based 4D Panoptic Segmentation via Dynamic Shifting Network},
  author={Hong, Fangzhou and Zhou, Hui and Zhu, Xinge and Li, Hongsheng and Liu, Ziwei},
  journal={arXiv preprint arXiv:2203.07186},
  year={2022}
}

Acknowledgments

In our implementation, we refer to the following open-source databases: