Development of HTC-DBSCAN: A Hierarchical Trajectory Clustering Algorithm with Automated Parameter Tuning

Existing route-clustering methods often fail to identify abnormal sections or similarities between routes, mainly when working with large or long datasets. While sub-route clustering can detect regional patterns, it struggles to accurately capture the overall route structure. The present study propo...

Full description

Saved in:
Bibliographic Details
Main Authors: Dae-Han Lee, Joo-Sung Kim
Format: Article
Language:English
Published: MDPI AG 2024-11-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/14/23/10995
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Existing route-clustering methods often fail to identify abnormal sections or similarities between routes, mainly when working with large or long datasets. While sub-route clustering can detect regional patterns, it struggles to accurately capture the overall route structure. The present study proposes a new ship route-clustering method that enhances computational efficiency and noise recognition while addressing these limitations. We refined Automatic Identification System data via four data-cleaning processes and applied a statistical distance measurement to assess ship trajectory similarity. Dimensionality reduction was then used to facilitate clustering. The clustering of ship route similarities is non-parametric and can be applied to datasets not separated based on density to find clusters of various densities. Density-Based Spatial Clustering of Applications (DBSCA) applies to many research fields; using the DBSCA with Noise (DBSCAN) algorithm, we propose an improved DBSCAN algorithm that automatically determines the parameters Epsilon and MinPts. In this study, as a core ship route-clustering process, we propose a sub-route clustering process by setting the distance and density of data points to clear standards for re-analysis and completion. The proposed approach demonstrates markedly enhanced clustering performance, offering a more sophisticated and efficient basis for ship route decision-making.
ISSN:2076-3417