DHD: A Python repository from lvchuandong

Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction

Yuan Wu^1*, Zhiqiang Yan^1*†, Zhengxue Wang¹, Xiang Li², Le Hui³, Jian Yang^1†

^*equal contribution ^†corresponding author
¹Nanjing University of Science and Technology ²Nankai University ³Northwestern Polytechnical University

[Paper] [Project Page]

Method

DHD comprises a feature extractor, HeightNet, DepthNet, MGHS, SFA, and predictor. The feature extractor first acquires 2D image feature. Then, DepthNet extracts context feature and depth prediction. HeightNet generates the height map to determine the height value at each pixel. Next, MGHS integrates the output of HeightNet and DepthNet, acquiring height-refined feature and depth-based feature. Finally, the dual features are fed into the SFA to obtain the aggregated feature, which serves as input for the predictor.

Get Started

Installation and Data Preparation

Step1、Prepare environment as that in Install.

Step2、Prepare nuScene and generate pkl file by runing：

python tools/create_data_bevdet.py

The finnal directory structure for 'data' folder is like

└── data
  └── nuscenes
      ├── v1.0-trainval 
      ├── sweeps  
      ├── samples
      ├── gts
      ├── bevdetv2-nuscenes_infos_train.pkl 
      └── bevdetv2-nuscenes_infos_val.pkl

Train & Test

# train:
tools/dist_train.sh ${config} ${num_gpu}
# train DHD-S:
tools/dist_train.sh projects/configs/DHD/DHD-S.py 4

# test:
tools/dist_test.sh ${config} ${ckpt} ${num_gpu} --eval mAP
# test DHD-S:
tools/dist_test.sh projects/configs/DHD/DHD-S.py model_weight/DHD-S.pth 4 --eval mAP

lvchuandong/DHD

Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction

Method

Get Started

Installation and Data Preparation

Train & Test

Model weights

Experiment

Quantitative comparison

Visual comparison

Acknowledgements

Citation