/ZOiRL

Zeroth-Order Implicit Reinforcement Learning. 🏆 Winner of 2021 CityLearn RL research competition!

Primary LanguageJupyter NotebookMIT LicenseMIT

Winner of 2021 CityLearn Challenge [2022 challenge]

Zeroth-Order Implicit Reinforcement Learning for Sequential Decision Making in Distributed Control Systems

This repository provides a reference implementation of ZO-iRL. Website. Beat second-best team by 120%