Create custom rewards handler

Question

Opened this issue a year ago · 1 comments

Why

user of pyCMO

to be able to specify different reward models for my scenarios

I can train RL agents

we currently only export the player's side's total score as the reward

we implement a way for users to specify a reward model

we get closer to being able to train RL agents

One idea is to create a custom RewardHandler class that gets passed into CMOEnv that can calculate the reward based on the current observation

Answer 1 · 2023-12-11T23:10:54.000Z

gymnasium provides reward wrappers