Pinned Repositories
deep-q-learning
Minimal Deep Q Learning (DQN & DDQN) implementations in Keras
dqn
Applying the DQN-Agent from keras-rl to Starcraft 2 Learning Environment and modding it to to use the Rainbow-DQN algorithms.
DQN-Pytorch
DQN-tensorflow
Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning
gym
A toolkit for developing and comparing reinforcement learning algorithms.
machinelearning
My blogs and code for machine learning. http://cnblogs.com/pinard
metaworld
An open source robotics benchmark for meta- and multi-task reinforcement learning
mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
MyDiscor
Prioritized-Sequence-Experience-Replay
Prioritized Sequence Experience Replay
zhipeng-yang's Repositories
zhipeng-yang/deep-q-learning
Minimal Deep Q Learning (DQN & DDQN) implementations in Keras
zhipeng-yang/dqn
Applying the DQN-Agent from keras-rl to Starcraft 2 Learning Environment and modding it to to use the Rainbow-DQN algorithms.
zhipeng-yang/DQN-Pytorch
zhipeng-yang/DQN-tensorflow
Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning
zhipeng-yang/gym
A toolkit for developing and comparing reinforcement learning algorithms.
zhipeng-yang/machinelearning
My blogs and code for machine learning. http://cnblogs.com/pinard
zhipeng-yang/metaworld
An open source robotics benchmark for meta- and multi-task reinforcement learning
zhipeng-yang/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
zhipeng-yang/MyDiscor
zhipeng-yang/Prioritized-Sequence-Experience-Replay
Prioritized Sequence Experience Replay
zhipeng-yang/prioritized_experience_replay
Prioritized Experience Replay implementation with proportional prioritization
zhipeng-yang/RLMDP-TEAE
The model called Temporal difference Error-based Adaptive Exploration (TEAE) for solving MDP. The model is based on the reinforcement learning method for MDP (RLMDP) and addresses the limitations of traditional MDP solving methods.
zhipeng-yang/RLSUM
https://doi.org/10.1016/j.physa.2023.128699
zhipeng-yang/simple_dqn
Simple deep Q-learning agent.
zhipeng-yang/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
zhipeng-yang/tensorflow
An Open Source Machine Learning Framework for Everyone
zhipeng-yang/tqdm
A Fast, Extensible Progress Bar for Python and CLI