vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

PythonNOASSERTION

Pinned issues

Roadmap for CleanRL

#115 opened 6 months ago by vwxyzjn

Closed0

PPO improvements

#206 opened 6 months ago by vwxyzjn

Closed0

Issues

Any usage of poetry after installation: No module named 'tomli'
#455 opened a month ago by MarcoMeter
1
Correct handling of `termination` vs `truncation`?
#457 opened a month ago by ankile
1
Potential bugs in RecordEpisodeStatistics
#454 opened 2 months ago by williamd4112
0
Contributing PPO + Transformer-XL
#442 opened 4 months ago by MarcoMeter
3
Unable to allocate 26.3 GiB for an array
#453 opened 2 months ago by satyrmipt
1
KeyError: 'cleanba_ppo_envpool_procgen'
#452 opened 2 months ago by zhixiongzh
0
Docker image is out of date
#451 opened 2 months ago by myxik
0
Gymnasium and PyBullet envs: Problem with video capture
#450 opened 2 months ago by nikisim
0
Potential bug in PPO+RND?
#416 opened 2 months ago by roger-creus
2
Why normarlize advantage only for pg_loss but not for vf_loss?
#441 opened 5 months ago by zzhixin
1
Gymnasium Version Requirement May Need To Be Updated
#445 opened 4 months ago by plen1lune
1
clamp in C51
#443 opened 4 months ago by XinJingHao
0
Reproduction util: wrong command path
#440 opened 5 months ago by qgallouedec
0
Fail to record video
#384 opened 6 months ago by yxdydgithub
3
Why converting observation space to np.float32?
#438 opened 5 months ago by jamartinh
2
Question about the `noise-clip` parameter in DDPG.
#419 opened 5 months ago by helpingstar
2
Poor Evaluation Performance in PPO
#425 opened 7 months ago by sdpkjc
5
can't upload video running ppo_atari.py,wandb has no data
#436 opened 5 months ago by flypark666
8
Liberate the requirements.txt
#387 opened a year ago by ThijsvandenBerg
4
Video upload Issue - wandb
#397 opened 6 months ago by tbasaklar
4
numpy version issue with python 3.10
#417 opened 6 months ago by martin-nginio
1
Pyyaml error on poetry install
#418 opened 6 months ago by hom-bahrani
10
[BUG] Env does not reset when it's terminated
#432 opened 6 months ago by modanesh
2
expected sequence of length 8 at dim 1 (got 0)
#431 opened 6 months ago by flypark666
4
[BUG] Different final epsilon and evaluation epsilon for Atari implementations
#429 opened 6 months ago by pseudo-rnd-thoughts
0
get action in sac_continuous_action.py
#428 opened 6 months ago by zichunxx
2
Is is possible for SAC to support gymnasium too as TD3 and PPO ?
#421 opened 7 months ago by qiuruiyu
4
run and experiment_name from docs/advanced/resume-training are undefined
#399 opened 7 months ago by Melanol
5
Performance compared with SB3
#405 opened 7 months ago by qiuruiyu
2
How to do evaluation for example on PPO
#400 opened 7 months ago by qiuruiyu
3
Bug in actor loss for sac_continuous_action.py
#379 opened 8 months ago by terencenwz
5
SAC cannot converge to optimal policy
#410 opened 9 months ago by mahaozhe
3
Adding new dependencies for ManiSkill2 clean rl
#413 opened 9 months ago by StoneT2000
0
Clean Offline RL (CORL) moved to a new fork
#411 opened 9 months ago by vkurenkov
1
Poetry installation failure on master
#391 opened 10 months ago by smorad
4
Is action masking supported in PPO?
#402 opened a year ago by aqibsaeed
2
The GAE calculation in PPO continuous action
#393 opened a year ago by baixianger
6
Are you interested in adding MPO
#390 opened a year ago by Jogima-cyber
4
Duplicate code in README.md
#388 opened a year ago by helpingstar
0
Bug of cleanrl_utils/evals
#380 opened a year ago by sdpkjc
3
ManiSkill2 - Fast Visual RL robotics cleanrl baselines
#366 opened a year ago by StoneT2000
2
Buy the contributors a cup of coffee
#374 opened a year ago by ShuoZheLi
0
Typo in the requirements section of the README.md documentation
#372 opened a year ago by helpingstar
0
WheelValidationError of torch package when running poetry install
#367 opened a year ago by StoneT2000
1
Poetry install fails: WheelFileValidationError: Pytorch 1.12
#365 opened a year ago by samholt
2
code error when running dqn.py
#364 opened a year ago by jianzuo
4
Is SAC exploration-noise used?
#361 opened a year ago by StoneT2000
1
Bug in RND Intrinsic Reward Normalization
#360 opened a year ago by akarshkumar0101
1
LSTM weights should have separate orthogonal initializations for each gate
#358 opened a year ago by Jammf
0
The file 'ppo_memory_env_lstm.py' can't be found
#356 opened a year ago by leeivan1007
2