Marine Voyage Optimization and Weather Routing with Deep Reinforcement Learning

Marine voyage optimization determines the optimal route and speed to ensure timely arrival. The problem becomes particularly complex when incorporating a dynamic environment, such as future expected weather conditions along the route and unexpected disruptions. This study explores two model-free Dee...

Full description

Saved in:

Bibliographic Details
Main Authors:	Charilaos Latinopoulos, Efstathios Zavvos, Dimitrios Kaklis, Veerle Leemen, Aristides Halatsis
Format:	Article
Language:	English
Published:	MDPI AG 2025-04-01
Series:	Journal of Marine Science and Engineering
Subjects:	voyage optimization weather routing deep reinforcement learning maritime energy efficiency
Online Access:	https://www.mdpi.com/2077-1312/13/5/902
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849711852940951552
author	Charilaos Latinopoulos Efstathios Zavvos Dimitrios Kaklis Veerle Leemen Aristides Halatsis
author_facet	Charilaos Latinopoulos Efstathios Zavvos Dimitrios Kaklis Veerle Leemen Aristides Halatsis
author_sort	Charilaos Latinopoulos
collection	DOAJ
description	Marine voyage optimization determines the optimal route and speed to ensure timely arrival. The problem becomes particularly complex when incorporating a dynamic environment, such as future expected weather conditions along the route and unexpected disruptions. This study explores two model-free Deep Reinforcement Learning (DRL) algorithms: (i) a Double Deep Q Network (DDQN) and (ii) a Deep Deterministic Policy Gradient (DDPG). These algorithms are computationally costly, so we split optimization into an offline phase (costly pre-training for a route) and an online phase where the algorithms are fine-tuned as updated weather data become available. Fine tuning is quick enough for en-route adjustments and for updating the offline planning for different dates where the weather might be very different. The models are compared to classical and heuristic methods: the DDPG achieved a 4% lower fuel consumption than the DDQN and was only outperformed by Tabu Search by 1%. Both DRL models demonstrate high adaptability to dynamic weather updates, achieving up to 12% improvement in fuel consumption compared to the distance-based baseline model. Additionally, they are non-graph-based and self-learning, making them more straightforward to extend and integrate into future digital twin-driven autonomous solutions, compared to traditional approaches.
format	Article
id	doaj-art-d445a7315a6a4f5d8e53f741f352de60
institution	DOAJ
issn	2077-1312
language	English
publishDate	2025-04-01
publisher	MDPI AG
record_format	Article
series	Journal of Marine Science and Engineering
spelling	doaj-art-d445a7315a6a4f5d8e53f741f352de602025-08-20T03:14:31ZengMDPI AGJournal of Marine Science and Engineering2077-13122025-04-0113590210.3390/jmse13050902Marine Voyage Optimization and Weather Routing with Deep Reinforcement LearningCharilaos Latinopoulos0Efstathios Zavvos1Dimitrios Kaklis2Veerle Leemen3Aristides Halatsis4VLTN BV, De Keyserlei 58-60 bus 19, 2018 Antwerp, BelgiumVLTN BV, De Keyserlei 58-60 bus 19, 2018 Antwerp, BelgiumDanaos Shipping Co., Ltd., 14 Akti Kondyli, 18545 Piraeus, GreeceVLTN BV, De Keyserlei 58-60 bus 19, 2018 Antwerp, BelgiumVLTN BV, De Keyserlei 58-60 bus 19, 2018 Antwerp, BelgiumMarine voyage optimization determines the optimal route and speed to ensure timely arrival. The problem becomes particularly complex when incorporating a dynamic environment, such as future expected weather conditions along the route and unexpected disruptions. This study explores two model-free Deep Reinforcement Learning (DRL) algorithms: (i) a Double Deep Q Network (DDQN) and (ii) a Deep Deterministic Policy Gradient (DDPG). These algorithms are computationally costly, so we split optimization into an offline phase (costly pre-training for a route) and an online phase where the algorithms are fine-tuned as updated weather data become available. Fine tuning is quick enough for en-route adjustments and for updating the offline planning for different dates where the weather might be very different. The models are compared to classical and heuristic methods: the DDPG achieved a 4% lower fuel consumption than the DDQN and was only outperformed by Tabu Search by 1%. Both DRL models demonstrate high adaptability to dynamic weather updates, achieving up to 12% improvement in fuel consumption compared to the distance-based baseline model. Additionally, they are non-graph-based and self-learning, making them more straightforward to extend and integrate into future digital twin-driven autonomous solutions, compared to traditional approaches.https://www.mdpi.com/2077-1312/13/5/902voyage optimizationweather routingdeep reinforcement learningmaritime energy efficiency
spellingShingle	Charilaos Latinopoulos Efstathios Zavvos Dimitrios Kaklis Veerle Leemen Aristides Halatsis Marine Voyage Optimization and Weather Routing with Deep Reinforcement Learning Journal of Marine Science and Engineering voyage optimization weather routing deep reinforcement learning maritime energy efficiency
title	Marine Voyage Optimization and Weather Routing with Deep Reinforcement Learning
title_full	Marine Voyage Optimization and Weather Routing with Deep Reinforcement Learning
title_fullStr	Marine Voyage Optimization and Weather Routing with Deep Reinforcement Learning
title_full_unstemmed	Marine Voyage Optimization and Weather Routing with Deep Reinforcement Learning
title_short	Marine Voyage Optimization and Weather Routing with Deep Reinforcement Learning
title_sort	marine voyage optimization and weather routing with deep reinforcement learning
topic	voyage optimization weather routing deep reinforcement learning maritime energy efficiency
url	https://www.mdpi.com/2077-1312/13/5/902
work_keys_str_mv	AT charilaoslatinopoulos marinevoyageoptimizationandweatherroutingwithdeepreinforcementlearning AT efstathioszavvos marinevoyageoptimizationandweatherroutingwithdeepreinforcementlearning AT dimitrioskaklis marinevoyageoptimizationandweatherroutingwithdeepreinforcementlearning AT veerleleemen marinevoyageoptimizationandweatherroutingwithdeepreinforcementlearning AT aristideshalatsis marinevoyageoptimizationandweatherroutingwithdeepreinforcementlearning

Marine Voyage Optimization and Weather Routing with Deep Reinforcement Learning

Similar Items