•  
  •  
 

Journal of System Simulation

Abstract

Abstract: The capacitated vehicle routing problem (CVRP) is a well-known combinatorial optimization challenge recognized as NP-hard due to its significant complexity. Building upon existing research, this paper introduces a novel end-to-end deep reinforcement learning approach based on a multi-pointer Transformer to tackle the CVRP. The proposed algorithm employs an invertible residual network in the encoder to encode input features, effectively reducing memory consumption. In the decoder, a multipointer network determines the probability distribution of solutions. To further enhance the performance of CVRP solutions, the algorithm leverages the symmetry in combinatorial optimization by implementing multi-trajectory parallel processing during both training and inference phases. Additionally, an enhanced contextual embedding method is utilized, and the model is trained using an improved reinforcement learning algorithm. Experimental results demonstrate that the proposed model strikes the best balance between solving speed and quality with lower memory usage compared to current classic heuristic algorithms and other deep learning approaches.

First Page

2177

Last Page

2187

CLC

TP18

Recommended Citation

Jiang Ming, He Tao. Solving the Vehicle Routing Problem Based on Deep Reinforcement Learning[J]. Journal of System Simulation, 2025, 37(9): 2177-2187.

Corresponding Author

He Tao

DOI

10.16182/j.issn1004731x.joss.24-0432

Share

COinS