/FER-multifrequency-so3

(ICLR2024) An official repo for the paper "An Intuitive Multi-Frequency Feature Representation For SO(3)-Equivariant Networks".

Primary LanguagePython

An Intuitive Multi-Frequency Feature Representation For SO(3)-Equivariant Networks (ICLR 2024 Poster)

Created by Dongwon Son, Jaehyung Kim, Sanghyeon Son, and Beomjoon Kim.

The usage of 3D vision algorithms, such as shape reconstruction, remains limited because they require inputs to be at a fixed canonical rotation. Recently, a simple equivariant network, Vector Neuron (VN) (Deng et al., 2021) has been proposed that can be easily used with the state-of-the-art 3D neural network (NN) architectures. However, its performance is limited because it is designed to use only three-dimensional features, which is insufficient to capture the details present in 3D data. In this paper, we introduce an equivariant feature representation for mapping a 3D point to a high-dimensional feature space. Our feature can discern multiple frequencies present in 3D data, which, as shown by Tancik et al. (2020), is the key to designing an expressive feature for 3D vision tasks. Our representation can be used as an input to VNs, and the results demonstrate that with our feature representation, VN captures more details, overcoming the limitation raised in its original paper.

[Project] | [Paper]

Setup environment

For classification, part segmentation, and occupancy networks, please set up a conda environment using...

conda env create --file environment.yaml

For GraphONet please refer to the directory. Please check out each directory for further instructions.

BibTex

@inproceedings{
son2024an,
title={An Intuitive Multi-Frequency Feature Representation for {SO}(3)-Equivariant Networks},
author={Dongwon Son and Jaehyung Kim and Sanghyeon Son and Beomjoon Kim},
booktitle={The Twelfth International Conference on Learning Representations},
year={2024},
url={https://openreview.net/forum?id=5JWAOLBxwp}
}