reinforcement

There are 108 repositories under reinforcement topic.

opennars/opennars
OpenNARS for Research 3.0+
Language:Java386 48 22584
RITCHIEHuang/DeepRL_Algorithms
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Language:Python324 11 1541
Alfredvc/paac
Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning
Language:Python206 21 358
learnables/cherry
A PyTorch Library for Reinforcement Learning Research
Language:Python197 17 932
AI4Finance-Foundation/RLSolver
Solvers for NP-hard and NP-complete problems with an emphasis on high-performance GPU computing.
Language:Python139 4 2432
milanboers/rurel
Flexible, reusable reinforcement learning (Q learning) implementation in Rust
Language:Rust139 6 1117
akolishchak/doom-net-pytorch
Reinforcement learning models in ViZDoom environment
Language:Python132 8 719
jiseongHAN/Super-Mario-RL
🍄Reinforcement Learning: Super Mario Bros with dueling dqn🍄
Language:Python103 3 319
bark-simulator/bark-ml
Gym environments and agents for autonomous driving.
Language:Python96 5 3118
lucylow/Deep-Learning-Mahjong---
Reinforcement learning (RL) implementation of imperfect information game Mahjong using markov decision processes to predict future game states
Language:JavaScript79 4 111
Nth-iteration-labs/contextual
Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
Language:R79 7 2426
wanyao1992/code_summarization_public
source code for 'Improving automatic source code summarization via deep reinforcement learning'
Language:Python77 6 729
bsl546/energym
Energym is an open source building simulation library designed to test climate control and energy management strategies on buildings in a systematic and reproducible way.
Language:Python69 8 1112
BardOfCodes/DRL_in_CV
A course on Deep Reinforcement Learning in Computer Vision. Visit Website:
Language:HTML65 8 012
d3sm0/gym_pomdp
Gym-like extensions for POMDP
Language:Python56 5 315
ktzsh/autonomous_drone_for_tracking
Autonomous Drone for Object Tracking
Language:Python49 5 218
amrit3701/FreeCAD-Reinforcement
Reinforcement Workbench for FreeCAD
Language:Python48 8 11719
orrivlin/MountainCar_DQN_RND
Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)
Language:Python40 0 38
mvrahden/reinforce-js
[INACTIVE] A collection of various machine learning solver. The library is an object-oriented approach (baked with Typescript) and tries to deliver simplified interfaces that make using the algorithms pretty simple.
Language:TypeScript31 4 56
A-Raafat/Torcs---Reinforcement-Learning-using-Q-Learning
Lane keeping assistant using Reinforcement learning
Language:Python30 1 25
maxwell-nc/AndroidDexEncrypt
one key encryptor android classes.dex and repatch apk
Language:Java27 3 14
srama2512/sidekicks
Sidekick Policy Learning for Active Visual Exploration (ECCV 2018)
Language:Python26 1 24
dp770/aws_deepracer_worksheet
Worksheet and Utilities for AWS DeepRacer – one of the most exciting ways of building strong skills in reinforcement learning and through a hands-on approach. This repository offers: 1) Functionally-rich and flexible reward function 2) Utilities with Jupiter notes for Racing Line calculation and visualisation of track 3) Scripts to parse RoboMaker training and evaluation logs to CSV file 4) Sample Excel file for car behaviour analysis as well as designing and planning new reward curves 5) Coordinates and AWS DeepRacer tracks and images.
Language:Python23 4 18
gsychi/64CrazyhouseDeepLearning
A deep learning Crazyhouse chess program that uses a Monte Carlo Tree Search (MCTS) based evaluation system and reinforcement to enhance its play style.
Language:Python18 6 01
zhaoyl18/SEIKO
SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all baselines (PPO, classifier-based guidance, direct reward backpropagation) for fine-tuning Stable Diffusion.
Language:Python17 3 30
kekmodel/gym-tictactoe-zero
Tic Tac Toe with Alpha Zero method - My first work
Language:Python16 3 04
Wanghailin2019/Learing-DRL-by-PyTorch-cookbook
本书作者是来自日本的Yutaro Ogawa(小川熊太郎），作者的github上源码是日文注释的，这个repository把它翻译成中文
Language:Jupyter Notebook16 1 03
rlturkiye/flying-cavalry
Flying Cavalry Project - Ucan Kavalye Projesi
Language:Python15 7 801
calclavia/rl
Reinforcement learning algorithms implemented using Keras and OpenAI Gym
Language:Python13 3 61
sohamghosh121/PacmanGym
Open AI Gym version of Berkeley AI Pacman with images as states
Language:Python13 3 29
comeh/DeepLearningForMDPs
Some codes used for the numerical examples proposed in https://arxiv.org/abs/1812.05916
Language:Python12 1 09
Beshario/DRL-Robotics-Arm
Robotic Arm learns to approach objects using Deep Reinforcement Learning
Language:C++11 1 10
CSKrishna/Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
Language:Jupyter Notebook11 4 05
heracleia/pyrdmp
Python Library for Dynamic Movement Primitives with Reinforcement Learning
Language:Python11 7 64
CarsonScott/Dual-Process-Reinforcement
An intelligent agent that adaptively changes its thought processes to maximize cumulative reward
10 3 01
Xingtao/ReinforceLearningIntro
Reinforcement Learning Introduction - Selected Exercise Solutions & Experiment Code
Language:Haskell9 2 00

reinforcement

opennars/opennars

RITCHIEHuang/DeepRL_Algorithms

Alfredvc/paac

learnables/cherry

AI4Finance-Foundation/RLSolver

milanboers/rurel

akolishchak/doom-net-pytorch

jiseongHAN/Super-Mario-RL

bark-simulator/bark-ml

lucylow/Deep-Learning-Mahjong---

Nth-iteration-labs/contextual

wanyao1992/code_summarization_public

bsl546/energym

BardOfCodes/DRL_in_CV

d3sm0/gym_pomdp

ktzsh/autonomous_drone_for_tracking

amrit3701/FreeCAD-Reinforcement

orrivlin/MountainCar_DQN_RND

mvrahden/reinforce-js

A-Raafat/Torcs---Reinforcement-Learning-using-Q-Learning

maxwell-nc/AndroidDexEncrypt

srama2512/sidekicks

dp770/aws_deepracer_worksheet

gsychi/64CrazyhouseDeepLearning

zhaoyl18/SEIKO

kekmodel/gym-tictactoe-zero

Wanghailin2019/Learing-DRL-by-PyTorch-cookbook

rlturkiye/flying-cavalry

calclavia/rl

sohamghosh121/PacmanGym

comeh/DeepLearningForMDPs

Beshario/DRL-Robotics-Arm

CSKrishna/Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting

heracleia/pyrdmp

CarsonScott/Dual-Process-Reinforcement

Xingtao/ReinforceLearningIntro