Journal of System Simulation
Abstract
Abstract: The capacitated vehicle routing problem (CVRP) is a well-known combinatorial optimization challenge recognized as NP-hard due to its significant complexity. Building upon existing research, this paper introduces a novel end-to-end deep reinforcement learning approach based on a multi-pointer Transformer to tackle the CVRP. The proposed algorithm employs an invertible residual network in the encoder to encode input features, effectively reducing memory consumption. In the decoder, a multipointer network determines the probability distribution of solutions. To further enhance the performance of CVRP solutions, the algorithm leverages the symmetry in combinatorial optimization by implementing multi-trajectory parallel processing during both training and inference phases. Additionally, an enhanced contextual embedding method is utilized, and the model is trained using an improved reinforcement learning algorithm. Experimental results demonstrate that the proposed model strikes the best balance between solving speed and quality with lower memory usage compared to current classic heuristic algorithms and other deep learning approaches.
Recommended Citation
Jiang, Ming and He, Tao
(2025)
"Solving the Vehicle Routing Problem Based on Deep Reinforcement Learning,"
Journal of System Simulation: Vol. 37:
Iss.
9, Article 1.
DOI: 10.16182/j.issn1004731x.joss.24-0432
Available at:
https://dc-china-simulation.researchcommons.org/journal/vol37/iss9/1
First Page
2177
Last Page
2187
CLC
TP18
Recommended Citation
Jiang Ming, He Tao. Solving the Vehicle Routing Problem Based on Deep Reinforcement Learning[J]. Journal of System Simulation, 2025, 37(9): 2177-2187.
DOI
10.16182/j.issn1004731x.joss.24-0432
Included in
Artificial Intelligence and Robotics Commons, Computer Engineering Commons, Numerical Analysis and Scientific Computing Commons, Operations Research, Systems Engineering and Industrial Engineering Commons, Systems Science Commons