Multi-task training

Question

Multi-task training

Maxwell2017 opened this issue 4 years ago · 1 comments

Maxwell2017 commented 4 years ago

In a multi-task training, how do you handle rewards between different tasks? I see CLIPPED_REWARDS in the code. Will the rewards of different types of tasks be added up and then backpropagated? Can you figure it out for me in the code?? @lespeholt

Answer 1 · 2021-01-18T11:36:25.000Z

There is no adding. You just train the games in a round robin fashion. One can do much better though, please see https://arxiv.org/abs/1809.04474