/Pensieve-PPO

The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art RL algorithms, including PPO, DQN, SAC, and support for both TensorFlow and PyTorch.

Primary LanguageDIGITAL Command LanguageBSD 2-Clause "Simplified" LicenseBSD-2-Clause