/ML_ODE

A Machine Learning model to solve an Ordinary Differential Equation

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

ML_ODE

ML_ODE provide the code to build a Machine Learning model for solving a Ordinary differential equation (ODE). An ODE problem consist in finding a function of time t f(t;x_0, Omega), dependent parametrically on x_0 and Omega, such that:

f'(t) = F(t,x,Omega)
f(t=0) = x_0

in this context x_0 is the value of the function at some initial time and Omega is a set of additional parameters useful to specify the time derivative of the problem.

As it is well known, a neural network can approximate (at least in principle) any real function, so: why not approximating the ODE solution with a Neural Network? In doing so we follow the seminal work Artificial Neural Networks for Solving Ordinary and Partial Differential Equations. The idea is further developed in a more modern paper and this is where we take inspiration from. Basically, we use the following ansatz for the solution:

f(t;x_0, Omega) = x_0 + (1-exp(-t))*NN(t, x_0, Omega)

This ansatz is pretty smart: it satisfies identically the boundary conditions and you can easily check how far f is from the actual solution by comparing its time derivative with the function F(t,Omega) of the problem. This idea is implemented in the loss function, which takes the following form:

L(t,x_0,Omega) = | f'(t;x_0, Omega) - F(t, f(t,_x0, Omega), Omega) |**2 * exp(-lambda*t)

Where lambda is a regularization constant to ensure that the training is more gradual. Once you have a loss function, it's all done: Tensorflow takes care of minimizing your loss function with respect to the weights of the NN.

You may wonder why we need a NN to solve an ODE, when we already have plenty of finite difference methods that works pretty well. The answer is simple: a NN is fast! This of course saves you a lot of computational resourches and at the same time you get the same degree of accuracy. Not convinced yet? A NN also provides a closed form expression for the solution, and once you have such expression you can differentiate it as many times as you wish: nice, isn't?

How it works

All the interesting code is wrapped in a single class, which provides you a (almost) ready to use model to fit. It is built to resemble a Keras model. You will find method fit() to train the model and method call() to evaluate the NN. You can use methods get_solution() or ODE_solution() to compute the solution that you are looking for. You can inherit the provided baseclass and customize your own model easily. You only need to specify the following:

  • The network architecture (by writing function set_model_properties()), given as a list of dense keras layers (list should be self._l_list)
  • Some hyperparameters of the networks: the dimensionality of the problem (n_vars), the number of variables Omega (n_params), the range in which every variable lives (in list constraints), as well as the range of time the model should be trained at. You might also set the value of a regularizer for the loss function, you can safely use a small value, such as 0.1~1. (See lambda parameter in eq.3 here for more information)
  • The function F(t,x,Omega). It should be a tensorflow function, which outputs a batch with shape (None, n_vars)

That's all. Not simple enough? Below you will find a simple example:

import ML_ODE
class my_ML_ODE(ML_ODE.ML_ODE_Basemodel):
	"Class for solving a ODE problem. Inherit from ML_ODE.ML_ODE_Basemodel and implements some methods."

	def set_model_properties(self):
		"This class should initialise some important paramters for the network"
		self.n_vars = 2	#number of variables in the problem
		self.n_params = 2 #number of parameters in the problem
		self.constraints = [(0.,1.),(-1., 1.), (-1., 1.), (-1., 1.),(-1., 1.)] #(t_range, x0_range, x1_range, *params_range)
		self.regularizer = 1. #regularizer for the loss function (not compulsory, default 0)
			#The regularizer act on the loss function L(t,x,Omega) s.t. L' = L * exp(regularizer*t): this is to ensure that earlier times are fitted with more accuracy

	def build_NN(self):
		"This class builds the fully connected NN architechture. Layer sequence should be provided by a list of layers (in self._l_list)"
		self._l_list.append(tf.keras.layers.Dense(128*2, activation=tf.nn.sigmoid) )
		self._l_list.append(tf.keras.layers.Dense(128, activation=tf.nn.sigmoid) )
		self._l_list.append(tf.keras.layers.Dense(self.n_vars, activation=tf.keras.activations.linear))
	
	
	def ODE_derivative(self, t, X, Omega): #actually z, D_L, Omegas (N,2)
		"""Here we write down the derivative of x: x' = F(t,x,Omega).
		Function should be written in pure tensorflow (input and outputs are tf tensors).
		Inputs:
			t (None,)					times
			X (None, n_vars)			values of the variables
			Omega (None, n_params)		parameters
		Output:
			x_prime (None,n_vars)/(None,)	derivative of the function
		"""
		return tf.math.multiply(X,Omega) #simple example where F_i(t,x,Omega) = x_i*Omega_i

You can find working examples in examples/example_model.py and examples/cosmological_model.py: the trained models are in folder examples/example_model and examples/cosmological_model.

Installing

It's super simple: you only need the file ML_ODE.py, which keeps all the code you need. Get it with:

wget https://raw.githubusercontent.com/stefanoschmidt1995/ML_ODE/main/ML_ODE.py

and you're done! You can import ML_ODE from every python script you like: that's all.

If you need also the examples, you might want to clone the whole repository

git clone https://github.com/stefanoschmidt1995/ML_ODE.git

Documentation

Every function is documented (I did my best to make it clear). Once you have instantiated a model, you can use the python help function for understanding the behaviour of the ML_ODE class or of a specific method:

help(ML_ODE)
help(ML_ODE.method_you_want_to_know_more_about)

Furthermore, you can have a look at the examples I provided.

If still something is not clear, or you wish to have more information, you can contact me at stefanoschmidt1995@gmail.com