Created by Huan Lei, Naveed Akhtar and Ajmal Mian
This work is a significant extension of our original work presented in IEEE CVPR2019, and is accepted to TPAMI in March 2020.
We propose a spherical kernel for efficient graph convolution of 3D point clouds. Our metric-based kernels systematically quantize the local 3D space to identify distinctive geometric relationships in the data. Similar to the regular grid CNN kernels, the spherical kernel maintains translation-invariance and asymmetry properties, where the former guarantees weight sharing among similar local structures in the data and the latter facilitates fine geometric learning. The proposed kernel is applied to graph neural networks without edge-dependent filter generation, making it computationally attractive for large point clouds. In our graph networks, each vertex is associated with a single point location and edges connect the neighborhood points within a defined range. The graph gets coarsened in the network with farthest point sampling. Analogous to the standard CNNs, we define pooling and unpooling operations for our network. We demonstrate the effectiveness of the proposed spherical kernel with graph neural networks for point cloud classification and semantic segmentation using ModelNet, ShapeNet, RueMonge2014, ScanNet and S3DIS datasets.
In this repository, we release the code and trained models for classification and segmentation.
If you find our work useful in your research, please consider citing:
@article{lei2020spherical,
title={Spherical Kernel for Efficient Graph Convolution on 3D Point Clouds},
author={Lei, Huan and Akhtar, Naveed and Mian, Ajmal},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
year={2020}
}
@article{lei2019octree,
title={Octree guided CNN with Spherical Kernels for 3D Point Clouds},
author={Lei, Huan and Akhtar, Naveed and Mian, Ajmal},
journal={IEEE Conference on Computer Vision and Pattern Recognition},
year={2019}
}
Our code is released under MIT License (see LICENSE file for details).
Install Tensorflow. The code was tested with Python 3.5, Tensorflow 1.12.0, Cuda 9.0 and Cudnn 7.1.4 on Ubuntu 16.04. The used GPU is NVIDIA Titan XP.
**Note: while implementing the new tensorlfow operators, we assumed that the GPU supports a block of 1024 threads.
Please compile the cuda-based operations in tf-ops folder using the command
(sudo) ./compile.sh
You may need to install Matlab. It is required to preprocess the datasets, such as the grid-based downsampling.
We preprocess each segmentation dataset using the corresponding function under the folder preprocessing:
preprocessing/shapenet_removeSingularPoints.m
preprocessing/ruemonge2014_prepare_data.m.m
preprocessing/scannet_prepare_data.m
preprocessing/s3dis_prepare_data.m
And then transform the *.txt files to tfrecord format for fast data feeding in Tensorflow:
cd io
python make_tfrecord_modelnet.py
python make_tfrecord_shapenet.py
python make_tfrecord_ruemonge2014.py
python make_tfrecord_scannet.py
python make_tfrecord_s3dis.py
python make_tfrecord_s3dis_no_split.py
All of the trained models and our results on ShapeNet and S3DIS can be downloaded from this link.
-
ModelNet
- To train a model to classify the 40 object classes:
cd modelnet40_cls python train_modelnet.py
- To test the classification results:
python evaluate_modelnet.py --num_votes=12
- To train a model to classify the 40 object classes:
-
ShapeNet
- To train a model to segment parts of the Table Category:
cd shapenet_seg python train_shapenet.py --shape_name=Table
- To test the segmentation performance of the trained model:
python evaluate_modelnet.py --shape_name=Table --model_name=xxxx
- To train a model to segment parts of the Table Category:
-
RueMonge2014
- train
cd ruemonge2014_seg python train_ruemonge2014.py
- test
python evaluate_ruemonge2014.py --model_name=xxxx
- train
-
ScanNet V2
Download the ScanNet dataset.- train
cd scannet_seg python train_scannet.py
- test
python evaluate_scannet_with_overlap.py --model_name=xxxx python scannet_block2index_with_overlap.py
- train
-
S3DIS
Download the S3DIS dataset.- train
cd s3dis_seg python train_s3dis.py
- test
python evaluate_s3dis_with_overlap.py --model_name=xxxx python s3dis_block2index_with_overlap.py
- train
The datasets are trained and tested with split blocks. We merge them back into complete scenes using functions under the folder post-merging in Matlab.
post-merging/scannet_merge.m
post-merging/s3dis_merge.m