Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform

Cleanba is CleanRL-style implementation of DeepMind's Sebulba distributed training platform, but with a few different design choices to make distributed RL more reproducible and transparent to use.

Get started

Prerequisites:

Installation:

poetry install
poetry run pip install --upgrade "jax[cuda11_cudnn82]==0.4.8" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html
poetry run python cleanba/cleanba_ppo.py --help
poetry run python cleanba/cleanba_ppo.py
poetry run python cleanba/cleanba_impala.py --help
poetry run python cleanba/cleanba_impala.py