Pinned Repositories
alien
alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4
alphazero_singleplayer
Single player Alpha Zero implementation
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
count_based_exploration_sr
Deep-Q-Learning
Tensorflow implementation of Deepminds dqn with double dueling networks
DeepReinforcementLearning
A replica of the AlphaZero methodology for deep reinforcement learning in Python
docker-pull-proxy
garage
A toolkit for reproducible reinforcement learning research
GATS
Surprising Negative Results for Generative Adversarial Tree Search
ZaneH1992's Repositories
ZaneH1992/alien
ZaneH1992/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4
ZaneH1992/alphazero_singleplayer
Single player Alpha Zero implementation
ZaneH1992/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
ZaneH1992/count_based_exploration_sr
ZaneH1992/Deep-Q-Learning
Tensorflow implementation of Deepminds dqn with double dueling networks
ZaneH1992/DeepReinforcementLearning
A replica of the AlphaZero methodology for deep reinforcement learning in Python
ZaneH1992/docker-pull-proxy
ZaneH1992/garage
A toolkit for reproducible reinforcement learning research
ZaneH1992/GATS
Surprising Negative Results for Generative Adversarial Tree Search
ZaneH1992/kernel_gateway
Jupyter Kernel Gateway
ZaneH1992/muzero-general
MuZero
ZaneH1992/muzero4all
MuZero for CS234
ZaneH1992/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
ZaneH1992/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
ZaneH1992/tensorlayer
Deep Learning and Reinforcement Learning Library for Scientists
ZaneH1992/tree-rl-adaptive
AlphaZero algorithm compatible with single-player deterministic environments with adaptive return normalization, optionally adaptive root return variance based MCTS iterations.