Instructor: Minhyuk Sung (mhsung [at] kaist.ac.kr)
TA: Juil Koo (63days [at] kaist.ac.kr)

This project aims to build a 3D diffusion model, specifically targeting 3D volume diffusion. We train the model using voxel data at a resolution of (128, 128, 128) from the ShapeNet chair class. A major challenge will be efficiently handling this high-resolution data within limited VRAM constraints.
The dataset consists of 2,658 chairs represented by binary voxels, where a value of 1 indicates the object's surface at that position. To obtain the voxel data, we voxelize point clouds from the ShapeNet dataset. Run the following command to preprocess the data:
python load_data.py
A 3D voxel visualization code is in visualize.ipynb
.
Your task is to implement a diffusion model that generates 3D voxels. You have the freedom to explore any methods or techniques to handle the hih-resolution data efficiently. After implementing the model, run the evaluaiton code provided and report the results. Below are further details on the evaluation.
Sample 1,000 voxels using your model and save them in .npy
format with a shape of (1000, 128, 128, 128)
. After saving the data, load the samples and run the following command to perform the quantitative evaluation:
python run_evaluation.py {PATH/TO/YOUR_SAMPLE_DATA.NPY}
The dataset is from ShapeNet.