/Brachistochrone

Using reinforcement learning and symbolic regression to discover the Brachistochrone

Primary LanguagePython

Using reinforcement learning and symbolic regression to discover the Brachistochrone

The video below shows a Reinforcement Learning agent trying to find increasingly faster paths from (0,0) to (3,-10) via only the influence of gravity. The orange curve is the optimal solution (the Brachistochrone), which the agent has no idea about. The best path found by the agent is the blue curve.

Screen.Recording.2023-10-13.at.18.38.46.mov

Brachistochrone.py just tries to guess faster paths from point A to point B under the influence of only gravity, and updates the fastest path via a matplotlib animation.

RL_Brachistrochrone.py implements a reinforcement learning environment with a neural network that tries to learn the best path under the the influence of only gravity, and also symbolic regression to produce candidate equations based on the coordinates of the best path found by the reinforcement learning agent. The paths are also shown in real-time via a matplotlib animation. An example output is shown in RL_Brachistrochrone.png

To Install

  1. Create and start virtual environment
python3 -m venv myvenv
source myvenv/bin/activate
  1. Install requirements.txt
cat requirements.txt | xargs -n 1 pip3 install

After installing requirements.txt, make the following file modifications (substituting your environment name with myvenv and your python version with python3.9):

myvenv/lib/python3.9/site-packages/rl/agents/ddpg.py

  • Change if hasattr(actor.output, '__len__') and len(actor.output) > 1: to if hasattr(actor.output, '__shape__') and len(actor.output) > 2:
  • Change if hasattr(critic.output, '__len__') and len(critic.output) > 1: to if hasattr(critic.output, '__shape__') and len(critic.output) > 2:
  • Comment out if self.uses_learning_phase:, critic_inputs += [K.learning_phase()], inputs += [self.training]

myvenv/lib/python3.9/site-packages/keras/engine/training_utils_v1.py

  • Comment out the following:
if len(data_shape) != len(shape):
    raise ValueError(
	"Error when checking "
	+ exception_prefix
	+ ": expected "
	+ names[i]
	+ " to have "
	+ str(len(shape))
	+ " dimensions, but got array with shape "
	+ str(data_shape)
    )

Then you can run the program:

python3 RL_Brachistochrone.py