Extending Competitive Mirror Descent to a multi-agent, reinforcement learning setting.
Primary LanguageJupyter Notebook