Spatial Nonstationarity Mitigation with Vision Transformer

📖 Table of Contents

Table of Contents

Spatial nonstationarity, ubiquitous in spatial settings, are generally not acocunted-for or even mitigated in deep learning (DL) applictaions for spatial phenomenon.
We design a workflow to explore the impacts of nonstationarity on DL prediction performance:
1. DL is trained with stationary SGS realizations of propoerty of interest (we use porosity for example) with variogram range labeled.
2. Test the DL prediction performance using nonstationary realiztaions (investigate the impacts of nonstationarity).
The benchmark results are obatined by training convolutional neural network (CNN) model, which is commonly used for computer vision (CV) tasks due to its performance in leanring spatial hierarchies of features.
Then we explore Vision Transformer (ViT) and Swin Transformer (SwinT) models for spatial nonstationarity mitigation. The original ViT and SwinT architectures are modifed for the predictive tasks (regression tasks).
We found out self-attention networks can effectively help mitigate the spatial nonstatioanrity.

The following open source packages are mainly used in this project:

Please also install other required packages when there is any missing (see detailed package uses in Jupyter notebooks)

code
.
├── Data Preparation.ipynb
├── train_CNN.ipynb
├── train_vision_transformers.ipynb  
├── SwinT.py    
├── ViT.py

The Data Preparation.ipynb walks through the training data and testing data calculation.

all data is calculated using sequential Gaussian simulation.
this notebook shows how to calculate SGS realization using GSLIB related packages.

The goals of this project include the following:

Prepare the training data >>> statioanry SGS realizations (see Data Preparation.ipynb) Train the same models - Decision Tree, k Nearest Neighbors, and Random Forest using the preprocessed data obtained from topological data analysis and compare the performance against the results obtained by Weiss et. al.
Train CNN model (see train_CNN.ipynb), train ViT, and SwinT model (see train_vision_transformers.ipynb) with training data. After the model is fully trained, test its prediction performance with testing data (nonstationary realizations).

CNN model is implemented with tensorflow packages. train_CNN.ipynb shows how to create your own CNN model and train it with your training data.
Vision transformers are implemented using Pytorch. train_vision_transformers.ipynb demonstrates the loading of ViT/SwinT architectures (ViT.py & SwinT.py), how to train the ViT/SwinT model. To visulzie the training progress, please couple it with tensorboard summary or wandb writer up to yourself.
Generally we need to train the DL models with a large number of training data. Here for easier demonstration, we randomly cretae single data (training data size =1) for both training and validation. In practical useage, you should train the model with data generated in Data Preparation.ipynb.

This work is supported by Digital Reservoir Characterization Technology (DIRECT) Industry Affiliate Program at the University of Texas at Austin.

👩‍🎓: Lei Liu
Email: leiliu@utexas.edu
GitHub: @LeiLiu

👨‍💻: Javier E. Santos
Email: jesantos@lanl.gov
GitHub: @je-santos

👩‍🏫: Maša Prodanović
Email: masha@utexas.edu
Personal Website: @Prodanović

👨‍🏫: Michael J. Pyrcz
Email: mpyrcz@austin.utexas.edu
GitHub: @GeostatsGuy