reinforcement
There are 108 repositories under reinforcement topic.
opennars/opennars
OpenNARS for Research 3.0+
RITCHIEHuang/DeepRL_Algorithms
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Alfredvc/paac
Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning
learnables/cherry
A PyTorch Library for Reinforcement Learning Research
AI4Finance-Foundation/RLSolver
Solvers for NP-hard and NP-complete problems with an emphasis on high-performance GPU computing.
milanboers/rurel
Flexible, reusable reinforcement learning (Q learning) implementation in Rust
akolishchak/doom-net-pytorch
Reinforcement learning models in ViZDoom environment
jiseongHAN/Super-Mario-RL
🍄Reinforcement Learning: Super Mario Bros with dueling dqn🍄
bark-simulator/bark-ml
Gym environments and agents for autonomous driving.
lucylow/Deep-Learning-Mahjong---
Reinforcement learning (RL) implementation of imperfect information game Mahjong using markov decision processes to predict future game states
Nth-iteration-labs/contextual
Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
wanyao1992/code_summarization_public
source code for 'Improving automatic source code summarization via deep reinforcement learning'
bsl546/energym
Energym is an open source building simulation library designed to test climate control and energy management strategies on buildings in a systematic and reproducible way.
BardOfCodes/DRL_in_CV
A course on Deep Reinforcement Learning in Computer Vision. Visit Website:
d3sm0/gym_pomdp
Gym-like extensions for POMDP
ktzsh/autonomous_drone_for_tracking
Autonomous Drone for Object Tracking
amrit3701/FreeCAD-Reinforcement
Reinforcement Workbench for FreeCAD
orrivlin/MountainCar_DQN_RND
Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)
mvrahden/reinforce-js
[INACTIVE] A collection of various machine learning solver. The library is an object-oriented approach (baked with Typescript) and tries to deliver simplified interfaces that make using the algorithms pretty simple.
A-Raafat/Torcs---Reinforcement-Learning-using-Q-Learning
Lane keeping assistant using Reinforcement learning
maxwell-nc/AndroidDexEncrypt
one key encryptor android classes.dex and repatch apk
srama2512/sidekicks
Sidekick Policy Learning for Active Visual Exploration (ECCV 2018)
dp770/aws_deepracer_worksheet
Worksheet and Utilities for AWS DeepRacer – one of the most exciting ways of building strong skills in reinforcement learning and through a hands-on approach. This repository offers: 1) Functionally-rich and flexible reward function 2) Utilities with Jupiter notes for Racing Line calculation and visualisation of track 3) Scripts to parse RoboMaker training and evaluation logs to CSV file 4) Sample Excel file for car behaviour analysis as well as designing and planning new reward curves 5) Coordinates and AWS DeepRacer tracks and images.
gsychi/64CrazyhouseDeepLearning
A deep learning Crazyhouse chess program that uses a Monte Carlo Tree Search (MCTS) based evaluation system and reinforcement to enhance its play style.
zhaoyl18/SEIKO
SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all baselines (PPO, classifier-based guidance, direct reward backpropagation) for fine-tuning Stable Diffusion.
kekmodel/gym-tictactoe-zero
Tic Tac Toe with Alpha Zero method - My first work
Wanghailin2019/Learing-DRL-by-PyTorch-cookbook
本书作者是来自日本的Yutaro Ogawa(小川熊太郎),作者的github上源码是日文注释的,这个repository把它翻译成中文
rlturkiye/flying-cavalry
Flying Cavalry Project - Ucan Kavalye Projesi
calclavia/rl
Reinforcement learning algorithms implemented using Keras and OpenAI Gym
sohamghosh121/PacmanGym
Open AI Gym version of Berkeley AI Pacman with images as states
comeh/DeepLearningForMDPs
Some codes used for the numerical examples proposed in https://arxiv.org/abs/1812.05916
Beshario/DRL-Robotics-Arm
Robotic Arm learns to approach objects using Deep Reinforcement Learning
CSKrishna/Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
heracleia/pyrdmp
Python Library for Dynamic Movement Primitives with Reinforcement Learning
CarsonScott/Dual-Process-Reinforcement
An intelligent agent that adaptively changes its thought processes to maximize cumulative reward
Xingtao/ReinforceLearningIntro
Reinforcement Learning Introduction - Selected Exercise Solutions & Experiment Code