Neural Architecture Search for GNN Potentials with Reinforcement Learning

Hypothesis: Performance of Graph Neural Networks in chemistry could potentially be improved by using NAS-based techniques

Goal: Reinforcement Learning-based NAS for materials datasets

NAS Formulation

GC Layers: CGConv, GATConv, GraphConv, GCNConv, Dense (tried others like Transformer/GPS Layers but those are difficult for single GPU)
Aggregation Operators: Add, Mean, Max
Activation Functions: ReLU, Sigmoid, Tanh, LeakyReLU, SiLU
Pooling Layers: Global add/mean/max
Normalization Layers: BatchNorm, LayerNorm, MessageNorm, GraphSizeNorm
Hyperparameters: learning rate, batch size, GC/Dense layer dims

Enhancement and Flexibility of design space (e.g., layer diversity). Ideally, if we can afford to train for extended periods, we could let controller learn how to build architectures from scratch. This is a typical approach in RL NAS
Parameter-sharing RL NAS: original plan due to more efficiency but trickier to implement, safer to start with more intuitive sequential approach
Smaller datasets (e.g., Jarvis DFT) which could allow us to train each child GNN until convergence using sequential NAS formulation
Training/Evaluation on full MP Dataset for best NAS architecture and baselines
Less exploratory objectives if training for shorter times (e.g., simple REINFORCE loss)
Molecule datasets since only Evolutionary Algorithms have been applied
Force training for child GNNs (only energy in experiments)