reward-shaping

There are 26 repositories under reward-shaping topic.

lcswillems/rl-starter-files
RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
Language:Python662 15 58186
salesforce/MultiHopKG
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Language:Jupyter Notebook301 15 2978
lcswillems/torch-ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Language:Python193 8 766
yining043/NeuOpt
This repo implements our paper, "Learning to Search Feasible and Infeasible Regions of Routing Problems with Flexible Neural k-Opt", which has been accepted at NeurIPS 2023.
Language:Jupyter Notebook39 1 13
csmile-1006/ARP
Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)
Language:Python33 1 11
kochlisGit/TraderNet-CRv2
TraderNet-CRv2 - Combining Deep Reinforcement Learning with Technical Analysis and Trend Monitoring on Cryptocurrency Markets
Language:Jupyter Notebook31 3 29
niksaz/dota2-expert-demo
Dota 2 bot that is trained by Deep RL with expert demonstrations
Language:Python30 3 24
holarissun/RewardShifting
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
Language:Python29 3 03
mike-gimelfarb/bayesian-reward-shaping
Bayesian Reward Shaping Framework for Deep Reinforcement Learning
Language:Python23 3 05
awilliea/Risk-based_RL_for_Optimal_Trading_Execution
Language:Python20 2 07
jbakams/slimebot-volleyball
3D gym environments to train RL agents to play the Slime Volleyball game in 3 dimensions using Webots as simulator.
Language:Python17 2 00
tongzhoumu/DrS
Code for "DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks"
Language:Python16 2 01
zhang-zengjie/Reward4Driving
Benchmarks for risk-aware reward shaping of autonomous driving
Language:Jupyter Notebook5 1 00
Digitalized-Energy-Systems/opfgym
A gymnasium-compatible framework to create reinforcement learning (RL) environment for solving the optimal power flow (OPF) problem. Contains five OPF benchmark environments for comparable research.
Language:Python3 1 270
Eugene1533/snake-ai-pytorch-complexification
Set of experiments of using weights of a previously trained network as prior knowledge for a more complicated one and reward providing.
Language:Python3
takato86/shaper
Reward shaping library
Language:Python3 2 00
dylwil3/reward-shaping
A lightweight package for running small experiments with reward shaping in reinforcement learning.
Language:Jupyter Notebook1 2 50
PoldervaartS/RLRLGym
Reinforcement Learning Exploration of PPO and training methods in Rocket League
Language:Python1 3 00
brianwade1/Gym_QLearning_and_RewardShaping
This repo demonstrates basic Q-learning for the Mountain Car Gym environment. It also shows how reward shaping can result in faster training of the agent.
Language:Python0 1 00
CAmayoral/hello-world
BAT Basic Attention Token, Brave, Uphold, DAPP, Cryptocurrenies.
0 1 00
IngyN/macsrl
Project for a semi-centralized logic-based MARL reward shaping method that is scalable in the number of agents and evaluates it in multiple scenarios
Language:Jupyter Notebook0 1 00
johnHostetter/PolicyPrep
Prepares policies from data to model; focuses on hierarchical tasks and applies reward shaping to handle delayed reward signals.
Language:Python0 2 00
lara-martin/StoryPlot-RewardShaping
Code from the IJCAI 2019 paper "Controllable Neural Story Plot Generation via Reward Shaping"
Language:Python0 1 01
RedLeader962/Une-intuition-sur-RUDDER
Ressources pour la présentation orale: "Une intuition sur RUDDER (Return Decomposition for Delayed Rewards)"
0 2 00
ChiragRadhakrishna43-7/Pacman_MultiAgents
Pacman games with multi agents. Evaluating the performance of Pacman and the ghosts.
Language:Python1 0
tanu10/RLPeri
[AAAI 2024] RLPeri: Accelerating Visual Perimetry Test with Reinforcement Learning and Convolutional Feature Extraction
Language:Python2 0

reward-shaping

lcswillems/rl-starter-files

salesforce/MultiHopKG

lcswillems/torch-ac

yining043/NeuOpt

csmile-1006/ARP

kochlisGit/TraderNet-CRv2

niksaz/dota2-expert-demo

holarissun/RewardShifting

mike-gimelfarb/bayesian-reward-shaping

awilliea/Risk-based_RL_for_Optimal_Trading_Execution

jbakams/slimebot-volleyball

tongzhoumu/DrS

zhang-zengjie/Reward4Driving

Digitalized-Energy-Systems/opfgym

Eugene1533/snake-ai-pytorch-complexification

takato86/shaper

dylwil3/reward-shaping

PoldervaartS/RLRLGym

brianwade1/Gym_QLearning_and_RewardShaping

CAmayoral/hello-world

IngyN/macsrl

johnHostetter/PolicyPrep

lara-martin/StoryPlot-RewardShaping

RedLeader962/Une-intuition-sur-RUDDER

ChiragRadhakrishna43-7/Pacman_MultiAgents

tanu10/RLPeri