Pinned Repositories
60_Days_RL_Challenge
Learn Deep Reinforcement Learning in Depth in 60 days
A-Deep-Reinforcement-Learning-Network-for-Traffic-Light-Cycle-Control
A Deep Reinforcement Learning Network for Traffic Light Cycle Control
A2C
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
aaai-goal
This is the codebase for the goal domain, implemented with Pygame.
Adaptive-Traffic-Signal-Control-Using-Reinforcement-Learning
This is an application exploiting principles of Deep Reinforcement Learning. The Deep Neural Network is trained to approximate the Bellman Equation (Q-Learning).
adeptRL
Reinforcement learning framework to accelerate research
agents
Efficient Batched Reinforcement Learning in TensorFlow
deep-reinforcement-learning
Repo for the Deep Reinforcement Learning Nanodegree program
gym-goal
OpenAI Gym environment for Robot Soccer Goal
reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
landoufulxf's Repositories
landoufulxf/aaai-goal
This is the codebase for the goal domain, implemented with Pygame.
landoufulxf/burlap_caffe
landoufulxf/DAN
Code release of "Learning Transferable Features with Deep Adaptation Networks" (ICML 2015)
landoufulxf/darkforestGo
DarkForest, the Facebook Go engine.
landoufulxf/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
landoufulxf/DeepReinforcementLearningThatMatters
Accompanying code for "Deep Reinforcement Learning that Matters"
landoufulxf/DNC-tensorflow
A TensorFlow implementation of DeepMind's Differential Neural Computers (DNC)
landoufulxf/DQN_DDQN_Dueling_and_DDPG_Tensorflow
Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient (DDPG)
landoufulxf/dynaq
Exploring the Dyna-Q reinforcement learning algorithm
landoufulxf/EECS-349-Project
landoufulxf/evolution-strategies-starter
Starter code for Evolution Strategies
landoufulxf/imitation
Contains an implementation of "Trust Region Policy Optimization" (TRPO)
landoufulxf/learning-to-communicate
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
landoufulxf/learning-to-learn
Learning to Learn in TensorFlow
landoufulxf/Meta-RL
Implementation of Meta-RL A3C algorithm
landoufulxf/predictron
Tensorflow implementation of "The Predictron: End-To-End Learning and Planning"
landoufulxf/quadrotor
Quadrotor control, path planning and trajectory optimization
landoufulxf/RADPBook
Source code for examples in Book "Robust Adaptive Dynamic Programming"
landoufulxf/ReinforcementLearningCode
Codes for understanding Reinforcement Learning( updating... )
landoufulxf/research-method
论文写作与资料分享
landoufulxf/RL-movie-recommender
The purpose of our research is to study reinforcement learning approaches to building a movie recommender system. We formulate the problem of interactive recommendation as a contextual multi-armed bandit.
landoufulxf/Self-Driving-Car-AI
A simple self-driving car AI python script using the deep Q-learning algorithm
landoufulxf/simple_dqn
Simple deep Q-learning agent.
landoufulxf/tensorflow-value-iteration-networks
TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper
landoufulxf/tensorflow_tutorials
From the basics to slightly more interesting applications of Tensorflow
landoufulxf/unreal
Reinforcement learning with unsupervised auxiliary tasks