/MADRQN

Multi-Agent Deep Recurrent Q-Learning with Bayesian epsilon-greedy on AirSim simulator

Primary LanguagePython

Watchers