/MATD3

Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships

Primary LanguagePython

Stargazers