DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
PythonMIT
Pinned issues
Issues
- 0
[Question] RL traning could not reach convergence for a customised environment
#449 opened by SKYLEO98 - 1
- 7
[Bug]: Training suddenly stops at 25000 timesteps and Optuna optimization immediately exits in my custom environment
#441 opened by Silver812 - 1
[Bug]: Optimization log and optimal policy not in `--optimization-log-path` but in `--log-folder`
#444 opened by turbotimon - 1
[Bug]: Custom environment not found in gym registry, you maybe meant... error message
#443 opened by jiceline03 - 1
- 6
[Question] Does hyperparameter tuning support custom vectorized environments?
#439 opened by antoinedang - 1
- 4
[Bug]: video recording on Pybullet
#412 opened by guillaumepourcel - 2
[Question] Custom Eval Callback for train/optimize
#434 opened by kingjin94 - 5
[Question] RuntimeError: Unable to sample before the end of the first episode. We recommend choosing a value for learning_starts that is greater than the maximum number of timesteps in the environment.
#433 opened by moneypi - 1
- 1
- 0
- 4
- 2
- 9
- 0
- 1
[Bug]: Cannot enjoy due to error Cannot convert space of type Discrete(7). Please upgrade your code to gymnasium.
#425 opened by notshashwat - 2
[Feature Request] Call train from Python code
#423 opened by younik - 4
- 2
- 3
- 1
Plotting Script Improvement
#413 opened by VinayHajare - 1
- 2
- 7
[Bug]: enjoy panda policy in hugging face
#408 opened by zhixiongzh - 3
Error when using recommended plotting commands
#387 opened by ikamensh - 2
- 3
[Feature Request] Remove gym dependency
#398 opened by ernestum - 4
[Bug]: Kwargs in `record_video.py` not preserved by SB3 Vector Environment wrappers
#399 opened by anayebi - 0
`optuna.trial.Trial.suggest_uniform()` deprecated
#401 opened by alperenunlu - 0
#enhancment
#400 opened by alperenunlu - 2
[Feature Request] Outdated dockerfile
#391 opened by VineetTambe - 1
- 5
[Question] exported ONNX model does not result in same output as the original pytorch model
#394 opened by VineetTambe - 0
Half-Cheetah reward function
#393 opened by scakki - 3
Upgrade all MuJoco envs to v4
#385 opened by araffin - 5
[Enhancement] Deprecate --gym-packages argument
#390 opened by ernestum - 2
[Question] Should "continue training" load the vecnormalize.pkl as well as the model.zip?
#386 opened by xibeisiber - 1
[Question] Correct way of hyper parameter optimization for new algorithm?
#384 opened by tyler-ingebrand - 0
Add link to custom environments page in sb3 docs
#383 opened by Melanol - 1
[Question] No module named 'ae' when running train.py
#377 opened by Guosy0506 - 4
[Bug]: enjoy tries to access huggingface
#382 opened by Melanol - 5
- 0
- 5
- 4
[Question] full_action_space not working as expected
#375 opened by Bleyddyn - 1
- 3
error due to model and env observation space mismatch when I load pretrained agent
#369 opened by chang2727