tegg89/rl-teacher-Pytorch

Deep reinforcement learning from human preferences in Pytorch (WIP)

PythonMIT

Readme
1Issue
8Stargazers
4Watchers

Watchers

Caesar-T
mlpanda
paper2code-bot
@paper2code
tegg89
EpiSys Science

Contact site admin: Geeks.