A Deep Reinforcement-Learning-Based Route Optimization Model for Multi-Compartment Cold Chain Distribution

Cold chain logistics is crucial in ensuring food quality and safety in modern supply chains. The required temperature control systems increase operational costs and environmental impacts compared to conventional logistics. To reduce these costs while maintaining service quality in real-world distrib...

Full description

Saved in:
Bibliographic Details
Main Authors: Jingming Hu, Chong Wang
Format: Article
Language:English
Published: MDPI AG 2025-06-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/13/13/2039
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Cold chain logistics is crucial in ensuring food quality and safety in modern supply chains. The required temperature control systems increase operational costs and environmental impacts compared to conventional logistics. To reduce these costs while maintaining service quality in real-world distribution scenarios, efficient route planning is essential, particularly when products with different temperature requirements need to be delivered together using multi-compartment refrigerated vehicles. This substantially increases the complexity of the routing process. We propose a novel deep reinforcement learning approach that incorporates a vehicle state encoder for capturing fleet characteristics and a dynamic vehicle state update mechanism for enabling real-time vehicle state updates during route planning. Extensive experiments on a real-world road network show that our proposed method significantly outperforms four representative methods. Compared to a recent ant colony optimization algorithm, it achieves up to a 6.32% reduction in costs while being up to 1637 times faster in computation.
ISSN:2227-7390