Enhancing Efficiency in Collision Avoidance: A Study on Transfer Reinforcement Learning in Autonomous Ships’ Navigation

Wang, Xinrui; Jin, Yan

contributor author	Wang, Xinrui
contributor author	Jin, Yan
date accessioned	2024-12-24T18:53:43Z
date available	2024-12-24T18:53:43Z
date copyright	7/12/2024 12:00:00 AM
date issued	2024
identifier issn	2770-3495
identifier other	aoje_3_031019.pdf
identifier uri	http://yetl.yabesh.ir/yetl1/handle/yetl/4302939
description abstract	Collision avoidance in ships and robotic vehicles exemplifies a complex work process that necessitates effective scenario recognition and precise movement decision-making. Machine learning methods addressing such work processes generally involve learning from scratch, which is not only time-consuming but also demands significant computational resources. Transfer learning emerges as a potent strategy to enhance the efficiency of these engineering work processes by harnessing previously acquired knowledge from analogous tasks, thereby streamlining the learning curve for new challenges. This research delves into two critical questions central to optimizing transfer reinforcement learning for the work process of collision avoidance: (1) Which process features can be successfully transferred across varying work processes? (2) What methodologies support the efficient and effective transfer of these features? Our study employs simulation-based experiments in ship collision avoidance to address these questions, chosen for their intrinsic complexity and the varied feature recognition it demands. We investigate and compare two transfer learning techniques—feature extraction and finetuning—utilizing a lightweight convolutional neural network (CNN) model pretrained on a base case of a comparable work process. Pixel-level visual input is leveraged to cover different numbers of encountering ships and fix the input size for the model. This model adeptly demonstrates the feasibility of transferring essential features to newer work process scenarios. Further, to enhance realism and applicability, we introduce a simplified yet comprehensive ship dynamic model that considers the substantial effects of ship inertia, thereby refining the interaction between the model and its environment. The response time is embedded into the reward function design to be considered for policy training. Experimental outcomes underscore the transferability of diverse process features and evaluate the relative effectiveness of the employed transfer methods across different task settings, offering insights that could be extrapolated to other engineering work processes.
publisher	The American Society of Mechanical Engineers (ASME)
title	Enhancing Efficiency in Collision Avoidance: A Study on Transfer Reinforcement Learning in Autonomous Ships’ Navigation
type	Journal Paper
journal volume	3
journal title	ASME Open Journal of Engineering
identifier doi	10.1115/1.4065831
journal fristpage	31019-1
journal lastpage	31019-13
page	13
tree	ASME Open Journal of Engineering:;2024:;volume( 003 ):;issue: 00
contenttype	Fulltext

Files in this item

Name:: aoje_3_031019.pdf
Size:: 816.3Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

ASME Open Journal of Engineering

Show simple item record

YaBeSH Engineering and Technology Library

Archive

Enhancing Efficiency in Collision Avoidance: A Study on Transfer Reinforcement Learning in Autonomous Ships’ Navigation

Files in this item

This item appears in the following Collection(s)