Pinned issues
Issues
- 3
PPO Complex Obs/Action Space
#353 opened - 7
About PPO+Procgen code on Jax
#352 opened - 23
Reproduction of Muesli
#350 opened - 1
Add Polyak update to DQN
#346 opened - 2
- 1
- 15
Cleanrl for MARL
#330 opened - 3
- 1
Typo in c51.py
#324 opened - 0
- 0
Benchmark `dqn_jax.py` using CPU only
#317 opened - 22
- 2
DDPG JAX breaks with python ~3.7
#309 opened - 0
unable to render video in gitpod
#305 opened - 0
SAC Implementation Details
#304 opened - 1
cuda with SAC
#303 opened - 0
- 1
RLops Guide
#296 opened - 8
ppo+lstm train continuous environments
#290 opened - 1
Re-benchmarking refactored algorithms
#289 opened - 3
Requirments - requirements-pettingzoo.txt
#283 opened - 2
Problem with multi-agent atari
#280 opened - 2
TD3 policy noise bugs
#279 opened - 3
- 3
SAC discrete
#266 opened - 3
Multi-objective hyperparameter optimization
#265 opened - 2
Upgrade gym version to 0.26.1
#263 opened - 2
- 5
Add TQC to CleanRL
#258 opened - 1
- 3
DQN on MountainCar
#255 opened - 0
Adding unit tests
#252 opened - 6
- 1
Adding Double DQN
#250 opened - 1
RL Formulation
#249 opened - 7
Adding Hierarchical RL Algorithms
#248 opened - 2
Poetry can't install torch nightly
#247 opened - 5
Adding TRPO implementation
#245 opened - 3
AsyncVectorEnv
#244 opened - 5
A question about the `PPO` algorithm
#240 opened - 1
Replace cloud utilities w/ `torchx`
#239 opened - 0
- 0
JAX + C51
#221 opened - 1
JAX + DQN
#220 opened - 1
JAX + TD3
#219 opened - 0
JAX Integration with CleanRL
#218 opened - 1
Prototype TD3 with JAX
#216 opened - 2
PPO with Humanoid
#215 opened - 3
Adding Average Reward PPO proposal
#210 opened - 0
Remove the value function clipping
#208 opened