SuperBench is a benchmark dataset and evaluation framework for super-resolution (SR) tasks in scientific domains. It provides high-quality datasets and baseline models for evaluating and comparing SR methods in various scientific applications.
An overview of super-resolution for weather data
- Diverse datasets: SuperBench includes high-resolution fluid flow (left), cosmology (right), and weather datasets (middle) with dimensions up to
$2048\times2048$ .
- Evaluation metrics: The framework provides comprehensive evaluation metrics for assessing SR performance, including:
- Pixel-level difference
- Human-level perception
- Domain-motivated error metrics
- Baseline models: Pre-trained baseline models are provided to facilitate comparison with state-of-the-art methods.
- Extensible framework: SuperBench is designed to be easily extendable, allowing the inclusion of new datasets and baseline models.
-
Navier-Stokes Kraichnan Turbulence
- Two fluid flow datasets are simulated with Reynolds numbers of
$Re=16000$ and$Re=32000$ . The spatial resolution of this dataset is$2048\times2048$ . - Three variables are considered: two velocity variables in the
$x$ and$y$ directions, as well as with the vorticity field.
- Two fluid flow datasets are simulated with Reynolds numbers of
-
Cosmology Hydrodynamics
- The spatial resolution is
$2048\times2048$ . The temperature and baryon density are provided in log scale. - A corresponding low-resolution simulation data is provided for a realistic super-resolution task.
- The spatial resolution is
-
Weather data
- The weather data is modified from ERA5. The spatial resolution is
$720\times1440$ . - Three channels are considered: Kinetic Energy (KE) at 10m from the surface, the temperature at 2m from surface, and total column water vapor.
- The weather data is modified from ERA5. The spatial resolution is
Table 1: A summary of SuperBench dataset.
- Bicubic interpolation
- SRCNN,
- Paper: Image Super-Resolution Using Deep Convolutional Networks
- Venue: TPAMI, 2015 (the first paper using deep NNs)
- Subpixel CNN
- EDSR
- Paper: Enhanced deep residual networks for single image super-resolution
- Venue: CVPR, 2017 (NTIRE2017 Super-Resolution Challenge winner)
- WDSR
- Paper: Wide activation for efficient and accurate image super-resolution
- Venue: BMVC, 2019 (NTIRE 2018 Super-Resolution Challenge winner)
- FNO
- Paper: Fourier Neural Operator for Parametric Partial Differential Equations
- Venue: ICLR, 2021
- SwinIR
- Paper: Swinir: Image restoration using swin transformer
- venue: CVPR, 2021
To assess the performance of these methods, we employ three distinct types of metrics: pixel-level difference metrics; human-level perception metrics; and domain-motivated error metrics.
- Pixel-level difference:
- relative Forbenius norm error (RFNE)
- infinity norm (IN)
- peak signal-to-noise ratio (PSNR)
- Human-level perception:
- structural similarity index measure (SSIM)
- Domain-motivated error metrics:
- physics errors (e.g., continuity loss)
- Energy Specturm
- Anomaly Correlation Coefficient (ACC)
- ...
We have evaluated several state-of-the-art SR models on the SuperBench dataset across different degradation scenarios. Here are an example result on weather dataset.
We present the baseline performance of various SR models on weather data with bicubi-downsampling degradation. Figure 1 shows visual comparisons of the baseline model reconstructions against the ground truth high-resolution images. (a) and (b) are x8 and x16 up-sampling tasks, respectively. Table 2 below provides quantitative evaluation results for weather data in terms of RFNE, IN, PSNR and SSIM metrics.
Figure 1: An example snapshot of baseline performance on weather data.
Table 2: Results for weather data with bicubic down-sampling.
For more detailed results and analysis, please refer to our paper.
We welcome contributions from the scientific machine learning community. If you would like to contribute to the baseline models of SuperBench, please open an issue on the GitHub repository. You may either request to push code to src.models
or provide a link to the trained models with model details.
To use SuperBench, follow these steps:
- Clone the repository:
git clone https://github.com/erichson/SuperBench.git
- Install the required dependencies:
pip install -r requirements.txt
- Download the SuperBench datasets:
wget https://portal.nersc.gov/project/dasrepo/superbench/superbench_v1.tar
-
Run the baseline models on the datasets:
2.1. Generate
.sh
code# train all baseline models used in SuperBench paper python generate_sh.py
2.2. Run the
.sh
code# train all baseline models used in SuperBench paper sh train_all.sh
-
Evaluate the model performance:
# evaluate RFNE, IN, PSNR, SSIM and physics loss
sh eval.sh
# Regarding plots
sh eval_plot.sh
- Visualize the SR results
# for visualization in the paper
python presentation/plot_snapshots.py
#
For detailed model configurations, please refer to the the folder config
.
We also welcome dataset contributions from the community. If you would like to contribute to SuperBench, please open an issue on the GitHub repository and provide a link to your datasets with data details.
If you encounter any issues or have any questions, please open an issue on the GitHub repository.
SuperBench is released under the GNU General Public License v3.0.