zhipeng-yang

Pinned Repositories

deep-q-learning
Minimal Deep Q Learning (DQN & DDQN) implementations in Keras
Language:Python0 0 00
dqn
Applying the DQN-Agent from keras-rl to Starcraft 2 Learning Environment and modding it to to use the Rainbow-DQN algorithms.
Language:Python00
DQN-Pytorch
Language:Python0 1 00
DQN-tensorflow
Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning
Language:Python0 0 00
gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python00
machinelearning
My blogs and code for machine learning. http://cnblogs.com/pinard
Language:Jupyter Notebook00
metaworld
An open source robotics benchmark for meta- and multi-task reinforcement learning
Language:Python0 0 00
mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Language:C#0 0 00
MyDiscor
Language:Jupyter Notebook0 0 00
Prioritized-Sequence-Experience-Replay
Prioritized Sequence Experience Replay
Language:Jupyter Notebook0 0 00

zhipeng-yang's Repositories

zhipeng-yang/deep-q-learning
Minimal Deep Q Learning (DQN & DDQN) implementations in Keras
Language:Python0 0 00
zhipeng-yang/dqn
Applying the DQN-Agent from keras-rl to Starcraft 2 Learning Environment and modding it to to use the Rainbow-DQN algorithms.
Language:Python00
zhipeng-yang/DQN-Pytorch
Language:Python0 1 00
zhipeng-yang/DQN-tensorflow
Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning
Language:Python0 0 00
zhipeng-yang/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python00
zhipeng-yang/machinelearning
My blogs and code for machine learning. http://cnblogs.com/pinard
Language:Jupyter Notebook00
zhipeng-yang/metaworld
An open source robotics benchmark for meta- and multi-task reinforcement learning
Language:Python0 0 00
zhipeng-yang/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Language:C#0 0 00
zhipeng-yang/MyDiscor
Language:Jupyter Notebook0 0 00
zhipeng-yang/Prioritized-Sequence-Experience-Replay
Prioritized Sequence Experience Replay
Language:Jupyter Notebook0 0 00
zhipeng-yang/prioritized_experience_replay
Prioritized Experience Replay implementation with proportional prioritization
Language:Python0 0
zhipeng-yang/RLMDP-TEAE
The model called Temporal difference Error-based Adaptive Exploration (TEAE) for solving MDP. The model is based on the reinforcement learning method for MDP (RLMDP) and addresses the limitations of traditional MDP solving methods.
Language:Python1 0
zhipeng-yang/RLSUM
https://doi.org/10.1016/j.physa.2023.128699
Language:Python1 0
zhipeng-yang/simple_dqn
Simple deep Q-learning agent.
Language:HTML0 0
zhipeng-yang/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
zhipeng-yang/tensorflow
An Open Source Machine Learning Framework for Everyone
Language:C++0 0
zhipeng-yang/tqdm
A Fast, Extensible Progress Bar for Python and CLI
Language:Python0 0