arXiv Analytics

Sign in

arXiv:2112.12545 [math.OC]AbstractReferencesReviewsResources

A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drone

Aigerim Bogyrbayeva, Taehyun Yoon, Hanbum Ko, Sungbin Lim, Hyokun Yun, Changhyun Kwon

Published 2021-12-22, updated 2022-12-05Version 3

Reinforcement learning has recently shown promise in learning quality solutions in many combinatorial optimization problems. In particular, the attention-based encoder-decoder models show high effectiveness on various routing problems, including the Traveling Salesman Problem (TSP). Unfortunately, they perform poorly for the TSP with Drone (TSP-D), requiring routing a heterogeneous fleet of vehicles in coordination -- a truck and a drone. In TSP-D, the two vehicles are moving in tandem and may need to wait at a node for the other vehicle to join. State-less attention-based decoder fails to make such coordination between vehicles. We propose a hybrid model that uses an attention encoder and a Long Short-Term Memory (LSTM) network decoder, in which the decoder's hidden state can represent the sequence of actions made. We empirically demonstrate that such a hybrid model improves upon a purely attention-based model for both solution quality and computational efficiency. Our experiments on the min-max Capacitated Vehicle Routing Problem (mmCVRP) also confirm that the hybrid model is more suitable for the coordinated routing of multiple vehicles than the attention-based model. The proposed model demonstrates comparable results as the operations research baseline methods.

Related articles: Most relevant | Search more
arXiv:2403.14013 [math.OC] (Published 2024-03-20)
Towards a connection between the capacitated vehicle routing problem and the constrained centroid-based clustering
arXiv:2005.03186 [math.OC] (Published 2020-05-07)
An Optimal Control Theory for the Traveling Salesman Problem and Its Variants
arXiv:2311.08827 [math.OC] (Published 2023-11-15)
A Deep Reinforcement Learning Approach to Efficient Distributed Optimization