initial-h

Shape the way you think.

Pinned Repositories

spectral-rl2
Representation Learning (RepL) Methods in Reinforcement Learning and Causal Inference
Language:Python26 4 18
AlphaZero_Gomoku_MPI
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
Language:Python201 10 4845
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python0 0 00
CEER
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay. ICLR 2023
Language:Python4 1 00
CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC
Language:Python0 0 00
Deep-Reinforcement-Learning
2 0 01
FlappyBird_DQN_with_target_network
DQN with freezing target network in tensorflow on pygame FlappyBird
Language:Python11 2 01
modded-nanogpt-experimental
this is a simple fork for this pull request https://github.com/KellerJordan/modded-nanogpt/pull/38
Language:Python10
spaceShooter_DQN
DQN with target network for spaceshooter
Language:Python1 0 00
TensorLayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
Language:Python7.3k 454 4671.6k

initial-h/AlphaZero_Gomoku_MPI
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
Language:Python201 10 4845
initial-h/FlappyBird_DQN_with_target_network
DQN with freezing target network in tensorflow on pygame FlappyBird
Language:Python11 2 01
initial-h/CEER
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay. ICLR 2023
Language:Python4 1 00
initial-h/Deep-Reinforcement-Learning
2 0 01
initial-h/modded-nanogpt-experimental
this is a simple fork for this pull request https://github.com/KellerJordan/modded-nanogpt/pull/38
Language:Python10
initial-h/spaceShooter_DQN
DQN with target network for spaceshooter
Language:Python1 0 00
initial-h/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python0 0 00
initial-h/CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC
Language:Python0 0 00
initial-h/dreamerv2
Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.
Language:Python0 0 00
initial-h/rl-rep
Representation Learning (RepL) Methods in Reinforcement Learning and Causal Inference
Language:Python00
initial-h/in-sample-deep-reinforcement-learning
Language:Python