AutoForce: A Python repository from amirhajibabaei

Introduction

This is a package for machine learning (ML) of the potential energy surface (PES) from costly ab initio calculations using the sparse Gaussian process regression (SGPR) algorithm. Ab initio calculations such as structure relaxation, AIMD, NEB, etc. can be substantially accelerated by fast ML models built on-the-fly. Moreover, the ML models built with smaller size of physical systems can be applied for simulations of larger systems which are impossible with direct ab initio methods. In principle, all the calculators supported by the atomic simulation environment (ASE) can be modeled.

Dependencies

required: numpy, scipy, pytorch, ase, mpi
conditional: mpi4py (see below)
optional: pymatgen, spglib, mendeleev, matplotlib, nglview, psutil, LAMMPS

mpi4py is only required if pytorch is not directly linked with mpi (i.e. torch.distributed.is_mpi_available() == False). Note that for coupling pytorch with mpi it should be compiled from the source. This package is regularly synced with the latest versions of ase and pytorch. Additional setting maybe needed for linking the ab initio calculators (VASP, GAUSSIAN, etc.) with ase (see this).

Installation

Clone the source code by

git clone https://github.com/amirhajibabaei/AutoForce.git

Go to the source code directory and install by

pip install .

Command line interface

For machine learning accelerated molecular dynamics, structure relaxation, etc (using VASP, GAUSSIAN, etc.) from the command line see theforce/cl/README.md.

Python API

It wraps ASE calculators:

from theforce.calculator.active import ActiveCalculator

# atoms = see ASE docs
# main_calc = see ASE calculators
# kernel = see the proceeding

calc = ActiveCalculator(calculator=main_calc)
atoms.set_calculator(calc)

# proceed with the desired calculations
# ...

For detailed information see theforce/calculator/README.md.

Optional coupling with LAMMPS

For running LAMMPS dynamics, it's python package should be installed. See the examples/LAMMPS folder.

Examples

For usage examples, see the examples/ folder.

Practical notes

On-the-fly ML

Ab initio calculations: The ab-initio calculators should be used only for single-point energy and force calculations. If on-the-fly ML fails, first and foremost, check if the underlying ab initio calculations (for the electronic structure) do converge.
ML models: The default settings are such that ML models are automatically saved and loaded in consecutive simulations. Thus check if the proper model is present in the working directory.
Initial structure for MD: Starting MD with a relaxed strucure (forces=0) is not advised. Either manually disturb the initial structure or use the rattle mechanism.
Structure optimization: Many structure relaxation algorithms depend on the forces history. With on-the-fly ML, every time the model is updated, forces suddenly change. The force discontinuity, if too large, may corrupt the optimizer. This can be avoided by reseting the optimizer history or training a preliminary model before relaxation.

Scalability

Distributed computing with MPI: The algorithm can use at most N (=number of atoms in the system) processes during MD. Using more processes can only speed-up the ML updates.
CUDA: Currently no GPU acceleration is implemented.
Species: Presence of more atomic species makes the simulation slower (often exponentially).

Citation

@article{PhysRevB.103.214102,
  title = {Sparse Gaussian process potentials: Application to lithium diffusivity in superionic conducting solid electrolytes},
  author = {Hajibabaei, Amir and Myung, Chang Woo and Kim, Kwang S.},
  journal = {Phys. Rev. B},
  volume = {103},
  issue = {21},
  pages = {214102},
  numpages = {7},
  year = {2021},
  month = {Jun},
  publisher = {American Physical Society},
  doi = {10.1103/PhysRevB.103.214102},
  url = {https://link.aps.org/doi/10.1103/PhysRevB.103.214102}
}