Pinned Repositories
4_Room_World_Environment
This repository provides a simulation of 4-Room-World environment.
AI-blog
Accompanying repository for Let's make a DQN / A3C series.
AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
cs231n.github.io
Public facing notes page
Deep-Reinforcement-Learning-for-Dynamic-Spectrum-Access
Using multi-agent Deep Q Learning with LSTM cells (DRQN) to train multiple users in cognitive radio to learn to share scarce resource (channels) equally without communication
Deep-RL-Keras
Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
DeepDoom
Capstone Project: Navigating 3D Environments Visually Using Distilled Hierarchical Deep Q-Networks
DeepLearningFlappyBird
Flappy Bird hack using Deep Reinforcement Learning (Deep Q-learning).
tutorials
机器学习相关教程
Laojiang012's Repositories
Laojiang012/tutorials
机器学习相关教程
Laojiang012/4_Room_World_Environment
This repository provides a simulation of 4-Room-World environment.
Laojiang012/AI-blog
Accompanying repository for Let's make a DQN / A3C series.
Laojiang012/AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Laojiang012/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Laojiang012/cs231n.github.io
Public facing notes page
Laojiang012/Deep-Reinforcement-Learning-for-Dynamic-Spectrum-Access
Using multi-agent Deep Q Learning with LSTM cells (DRQN) to train multiple users in cognitive radio to learn to share scarce resource (channels) equally without communication
Laojiang012/Deep-RL-Keras
Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
Laojiang012/DeepLearningFlappyBird
Flappy Bird hack using Deep Reinforcement Learning (Deep Q-learning).
Laojiang012/deepmind-research
This repository contains implementations and illustrative code to accompany DeepMind publications
Laojiang012/hierarchical_IL_RL
Code for hierarchical imitation learning and reinforcement learning
Laojiang012/introRL
Intro to Reinforcement Learning (强化学习纲要)
Laojiang012/ipss-common
InterPSS common for lib, 3rd-party lib
Laojiang012/keras-ddpg
Python keras + tensorflow implementation of DDPG solving modified open gymAI pendulum-v0 environment
Laojiang012/learnlib
A free, open source Java library for automata learning algorithms.
Laojiang012/lihang-code
《统计学习方法》的代码实现
Laojiang012/matmtdc
MATMTDC is a free open source Matlab based program for performing dynamic analysis of AC/DC hybrid power systems. It is inspired by MatPower, a power flow and optimal power flow program in MATLAB. MATMTDC is an easy-to-use and easy-to-modify simulation tool for researchers and educators. Care has been taken to keep it well structured and easy to un
Laojiang012/ML-CS7641
Laojiang012/morvanzhou.github.io
莫烦Python Website source code
Laojiang012/move37
Coding Demos from the School of AI's Move37 Course
Laojiang012/options-hierarchical-rl
Laojiang012/reinforce-tf
a brief survey and implementations of deep reinforcement learning papers in tensorflow (in progress)
Laojiang012/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
Laojiang012/RL-Adventure
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
Laojiang012/rl-intro-book-chinese
Chinese Translation for Book 《Reinforcement Learning- An Introduction》-Second Edition
Laojiang012/tensorflow-windows-wheel
Tensorflow prebuilt binary for Windows
Laojiang012/tensorforce
Tensorforce: a TensorFlow library for applied reinforcement learning
Laojiang012/tensorlayer
Deep Learning and Reinforcement Learning Library for Scientists
Laojiang012/unreal
Reinforcement learning with unsupervised auxiliary tasks
Laojiang012/WZU-machine-learning-course
温州大学《机器学习》课程资料(代码、课件等)