model-free-rl
There are 45 repositories under model-free-rl topic.
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
MishaLaskin/curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
dongminlee94/deep_rl
PyTorch implementation of deep reinforcement learning algorithms
yingchengyang/Reinforcement-Learning-Papers
Related papers for reinforcement learning, including classic papers and latest papers in top conferences
erfanMhi/Deep-Reinforcement-Learning-CS285-Pytorch
Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
dongminlee94/Samsung-DRL-Code
A repository for implementation of deep reinforcement learning lectured at Samsung
xlnwel/model-free-algorithms
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
manantomar/Mirror-Descent-Policy-Optimization
Mirror Descent Policy Optimization
dongminlee94/Reinforcement-Learning-Code
A repository for code of reinforcement learning algorithms with PyTorch
makaveli10/reinforcementLearning
Reinforcement Learning - Implementation of Exercises, algorithms from the book Sutton Barto and David silver's RL course in Python, OpenAI Gym.
AIR-DI/D2C
D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.
fedebotu/vision-cartpole-dqn
Implementation of the CartPole from OpenAI's Gym using only visual input for Reinforcement Learning control with DQN
paulvantieghem/curla
CURLA: CURL x CARLA -- Robust end-to-end Autonomous Driving by combining Contrastive Learning and Reinforcement Learning
mohammadzainabbas/Reinforcement-Learning-CS
💡 Grasp - Pick-and-place with a robotic hand 👨🏻💻
wiitt/DQN-Car-Racing
Implementation of DQN and DDQN algorithms for Playing Car Racing Game
MathPhysSim/FERMI_RL_Paper
The repo for the FERMI FEL paper using model-based and model-free reinforcement learning methods to solve a particle accelerator operation problem.
SSubhnil/BAC-DAC-gym
Bayesian Actor-Critic with Neural Networks. Developing an OpenAI Gym toolkit for Bayesian AC reinforcement learning.
fedebotu/a3c-flappy-bird
Deep Reinforcement Learning implementation in Keras of an AI controlling the popular Flappy Bird videogame, using Asynchronous Advantage Actor Critic (A3C)
matakshay/DeepRL-for-Delayed-Rewards
Deep RL for Temporal Credit Assignment in decision processes with delayed rewards
narjesno/Reinforcement-Learning
This repository contains all of the Reinforcement Learning-related projects I've worked on. The projects are part of the graduate course at the University of Tehran.
cbanerji/Sample_efficient_RL.
Collection of codes pertaining to my research in model-free RL algorithms.
ShreeshaN/ReinforcementLearningTutorials
This repo contains implementations of algorithms such a Q-learning, SARSA, TD, Policy gradient
Brownwang0426/Reversal-Generative-Reinforcement-Learning
A simple model-free and value-function-free reinforcement learning model
koulanurag/variable-td3
Learning n-step actions for control tasks
shaheennabi/Reinforcement-or-Deep-Reinforcement-Learning-Practices-and-Mini-Projects
Reinforcement Learning (RL) 🤖! This repository is your hands-on guide to implementing RL algorithms, from Markov Decision Processes (MDPs) to advanced methods like PPO and DDPG. 🚀 Build smart agents, learn the math behind policies, and experiment with real-world applications! 🔥💡
xlnwel/atari_rl
Rainbow, IQN on atari games
DeTraffic/detraffic
A multi-agent deep reinforcement learning model to de-traffic our lives
LauraKarimova/Big_Data_Research_Project
The 3D bin packing problem is a combinatorial optimization problem that involves fitting a given set of items of various sizes into a container of a specific size such that the total volume of the items is as close to the volume of the container as possible.
esther-poniatowski/Massi2022
Testing different Reinforcement Learning strategies inspired by hippocampal replay for robotic navigation
kaushal1120/ReinforcementLearning
Implementation of model-free reinforcement learning to learn the correct policy in a Markov Decision Process. The input to the program is a Markov Decision Process and the program will learn what to do through experimentation.
NUS-LID/RENAULT
Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning
RanKyoto/RE-POMDP
Deep reinforcement learning for static noisy state feedback control with reward estimation
spktrm/porygon2
Porygon2 is a Node.js-based server for simulating Pokémon battles using the pkmn library. Easily set up the environment with the provided script for training and evaluation.