yyds-xtt/UCB_MARL

The simulation codes of a provably efficient multi-agent reinforcement learning algorithm with a near-optimal regret bound in industrail data collection.

Python

Readme
0Issues
1Stargazer
0Watchers

Stargazers

yc-whu

Contact site admin: Geeks.