/CACC-and-EMS-for-ERELV-platoon

Extended-range electric logistics van platoon; Cooperative adaptive cruise control; Distributed model predictive control; Energy management strategy; Multi-agent deep deterministic policy gradient

Primary LanguageJupyter Notebook

Collaborative adaptive cruise control and energy management strategy for extended-range electric logistics van platoon

https://journals.sagepub.com/doi/10.1177/09544070231193187

This paper improves the economy of the extended-range electric logistics van (ERELV) platoon from two aspects of cooperative adaptive cruise control (CACC) and energy management strategy (EMS).

Based on the DMPC algorithm, an Eco-CACC controller with the optimization goals of stability, comfort and energy-saving is designed.

The multi-agent deep reinforcement learning (MADRL) algorithm is used to solve the EMS of the ERELV platoon.

DQN, DDPG and MADDPG were used as energy management strategies for extended range electric logistics van platoon.

Happy to answer any questions you have. Please email us at wg@njust.edu.cn

Eco-CACC and MADRL EMS overall control framework

论文图片

Eco-CACC based on DMPC

A single-point optimization problem is defined on each vehicle node in the platoon, and the model prediction optimization problem of all nodes needs to be solved at each optimization moment. Using the domain node information obtained by the PLF communication topology, each sub-problem is optimized to obtain the control input of the vehicle.

image

EMS for ERELV platoon

The picture shows the basic framework of EMS based on deep reinforcement learning (DRL).The environment model includes hybrid power system and driving environment, and the agent module contains learning algorithm. Through the interaction between agent and environment to optimize network parameters, agent will learn to output optimal actions to the environment to maximize the cumulative reward.

image

The state transition of ERELV has the Markov property and generates a corresponding energy consumption at each time step. Since each agent cannot know the complete environment state, the EMS of the ERELV platoon can be modeled as a partially observable POMDP. The task types applied by MADRL algorithm are divided into fully cooperative, fully competitive and hybrid types. For the EMS of the ERELV platoon, each vehicle node needs to jointly explore the optimal control behavior under different vehicle states, so it is a fully cooperative MADRL problem.

image