Pinned Repositories
spectral-rl2
Representation Learning (RepL) Methods in Reinforcement Learning and Causal Inference
AlphaZero_Gomoku_MPI
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
CEER
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay. ICLR 2023
CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC
Deep-Reinforcement-Learning
dreamerv2
Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.
FlappyBird_DQN_with_target_network
DQN with freezing target network in tensorflow on pygame FlappyBird
spaceShooter_DQN
DQN with target network for spaceshooter
TensorLayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
initial-h's Repositories
initial-h/AlphaZero_Gomoku_MPI
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
initial-h/FlappyBird_DQN_with_target_network
DQN with freezing target network in tensorflow on pygame FlappyBird
initial-h/CEER
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay. ICLR 2023
initial-h/Deep-Reinforcement-Learning
initial-h/spaceShooter_DQN
DQN with target network for spaceshooter
initial-h/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
initial-h/CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC
initial-h/dreamerv2
Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.
initial-h/rl-rep
Representation Learning (RepL) Methods in Reinforcement Learning and Causal Inference