HumanCompatibleAI/population-irl

(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards

PythonMIT

Readme
18Issues
25Stargazers
9Watchers

Watchers

AdamGleave
FAR AI
Discordius
eemailme
jenchen
Berkeley, CA
jhcloos
justicelee
paper2code-bot
@paper2code
rohinmshah
@HumanCompatibleAI
xuweijiezds

Contact site admin: Geeks.