-
Install SMARTS following the instructions in
./SMARTS
folder -
Download the original data from https://jbox.sjtu.edu.cn/v/link/view/75a0931a222347e1ba2e0441407f4a1f and place it under
./ngsim
folder -
Build NGSIM scenario with
scl build --clean ./ngsim
-
Generate expert demonstrations with
python example_expert_generation.py
-
Test rollout with
python example_rollout.py
我们实现了使用GAIL+TD3算法来实现驾驶决策任务,同时并使用Behavior Clone算法先预训练TD3模型,使得生成出来的决策分布和专家分布相近,减缓分布差距过大导致的梯度消失问题。此外还引入梯度惩罚和梯度裁剪来使得训练更加稳定。
快车道效果
慢车道效果