/RL-Power-Distribution-for-power2heat

PyTorch implementation of a Monte Carlo Policy Gradient approach to learn an optimal policy for a power-to-heat device to distribute excess power in a dynamic environment.

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Watchers