Large Data Oriented to Image Information Fusion Spark and Improved Fruit Fly Optimization Based on the Density Clustering Algorithm

The density-based applied spatial clustering algorithm is an algorithm based on high-density interconnected regions, which discovers class clusters of arbitrary shapes in noisy data sets and is widely used. However, it suffers from slow computation speed due to large-scale disk I/O and clustering bi...

Full description

Saved in:
Bibliographic Details
Main Author: Yanfang Zhang
Format: Article
Language:English
Published: Wiley 2023-01-01
Series:Advances in Multimedia
Online Access:http://dx.doi.org/10.1155/2023/5596605
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850186013505224704
author Yanfang Zhang
author_facet Yanfang Zhang
author_sort Yanfang Zhang
collection DOAJ
description The density-based applied spatial clustering algorithm is an algorithm based on high-density interconnected regions, which discovers class clusters of arbitrary shapes in noisy data sets and is widely used. However, it suffers from slow computation speed due to large-scale disk I/O and clustering bias due to uneven density class clusters and poor parameter search ability. To address these problems, a parallel density clustering algorithm based on an improved fruit fly optimization algorithm and Spark memory iteration is proposed. The proposed algorithm first divides the data grid using an irregular dynamic density region partitioning strategy. Then, a hybrid fruit fly particle swarm algorithm based on a genetic optimization mechanism is proposed to achieve dynamic optimization seeking for parameters in local clustering to improve the clustering effect of local clustering. Finally, the local merging of samples in irregularly bounded grid cells under each partition is achieved by designing a custom clustering merging strategy. The experiments show that the improved algorithm is generally applicable to the clustering of different shape class clusters and larger scale data and has obvious improvement in accuracy and parallel efficiency.
format Article
id doaj-art-405d8a2353d44b63af1e67863d4ec3bb
institution OA Journals
issn 1687-5699
language English
publishDate 2023-01-01
publisher Wiley
record_format Article
series Advances in Multimedia
spelling doaj-art-405d8a2353d44b63af1e67863d4ec3bb2025-08-20T02:16:30ZengWileyAdvances in Multimedia1687-56992023-01-01202310.1155/2023/5596605Large Data Oriented to Image Information Fusion Spark and Improved Fruit Fly Optimization Based on the Density Clustering AlgorithmYanfang Zhang0Jiaozuo Normal CollegeThe density-based applied spatial clustering algorithm is an algorithm based on high-density interconnected regions, which discovers class clusters of arbitrary shapes in noisy data sets and is widely used. However, it suffers from slow computation speed due to large-scale disk I/O and clustering bias due to uneven density class clusters and poor parameter search ability. To address these problems, a parallel density clustering algorithm based on an improved fruit fly optimization algorithm and Spark memory iteration is proposed. The proposed algorithm first divides the data grid using an irregular dynamic density region partitioning strategy. Then, a hybrid fruit fly particle swarm algorithm based on a genetic optimization mechanism is proposed to achieve dynamic optimization seeking for parameters in local clustering to improve the clustering effect of local clustering. Finally, the local merging of samples in irregularly bounded grid cells under each partition is achieved by designing a custom clustering merging strategy. The experiments show that the improved algorithm is generally applicable to the clustering of different shape class clusters and larger scale data and has obvious improvement in accuracy and parallel efficiency.http://dx.doi.org/10.1155/2023/5596605
spellingShingle Yanfang Zhang
Large Data Oriented to Image Information Fusion Spark and Improved Fruit Fly Optimization Based on the Density Clustering Algorithm
Advances in Multimedia
title Large Data Oriented to Image Information Fusion Spark and Improved Fruit Fly Optimization Based on the Density Clustering Algorithm
title_full Large Data Oriented to Image Information Fusion Spark and Improved Fruit Fly Optimization Based on the Density Clustering Algorithm
title_fullStr Large Data Oriented to Image Information Fusion Spark and Improved Fruit Fly Optimization Based on the Density Clustering Algorithm
title_full_unstemmed Large Data Oriented to Image Information Fusion Spark and Improved Fruit Fly Optimization Based on the Density Clustering Algorithm
title_short Large Data Oriented to Image Information Fusion Spark and Improved Fruit Fly Optimization Based on the Density Clustering Algorithm
title_sort large data oriented to image information fusion spark and improved fruit fly optimization based on the density clustering algorithm
url http://dx.doi.org/10.1155/2023/5596605
work_keys_str_mv AT yanfangzhang largedataorientedtoimageinformationfusionsparkandimprovedfruitflyoptimizationbasedonthedensityclusteringalgorithm