VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection

Official Implementation of "VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection" [CVPR 2024]

Installation

Setup the conda environment.

conda env create -f environment.yml

Install this repository.

pip install -e .

Data Preparation

Download the KITTI-360 dataset.
Transform the 3D bounding boxes from the world coordinate system to the camera coordinate system.

python tools/datasets/kitti_360/make_annotations.py

For each target view, sample source frames for multi-view 3D auto-labeling.

python tools/datasets/kitti_360/sample_annotations.py

Multi-View 3D Auto-Labeling

Our multi-view 3D auto labeling optimizes the 3D bounding boxes and residual signed distance fields (RDF) for each scene. Note that the data is split and distributed to each process, but unlike general distributed training, the gradients are not averaged between processes.

Slurm Workload Manager

python -m vsrd.distributed.slurm.launch \
    --partition PARTITION \
    --num_nodes NUM_NODES \
    --num_gpus NUM_GPUS \
    scripts/main.py \
        --launcher slurm \
        --config CONFIG \
        --train

Torch Distributed Launcher

torchrun \
    --rdzv_backend c10d \
    --rdzv_endpoint HOST_NODE_ADDR \
    --nnodes NUM_NODES \
    --nproc_per_node NUM_GPUS \
    scripts/main.py \
        --launcher torch \
        --config CONFIG \
        --train

dstarer/VSRD

VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection

Installation

Data Preparation

Multi-View 3D Auto-Labeling