TakuyaHiraokaNEC's Stars
AndroidArenaAgent/AndroidArena
edwhu/suika_rl
Gymnasium environment for Suika game
jiawei415/VCP
evgenii-nikishin/rl_with_resets
JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
danijar/dreamerv3
Mastering Diverse Domains through World Models
rail-berkeley/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
araffin/sbx
SBX: Stable Baselines Jax (SB3 + Jax)
t6-thu/H2O
[NeurIPS'22 Spotlight] When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
toshikwa/sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
Howuhh/faster-trajectory-transformer
Implementation of Trajectory Transformer with attention caching and batched beam search