Parallelizing Comprehensive Learning Particle Swarm Optimization by Open Computing Language on an Integrated Graphical Processing Unit

Comprehensive learning particle swarm optimization (CLPSO) is a powerful metaheuristic for global optimization. This paper studies parallelizing CLPSO by open computing language (OpenCL) on the integrated Intel HD Graphics 520 (IHDG520) graphical processing unit (GPU) with a low clock rate. We imple...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiang Yu, Yu Qiao, Qingpeng Li, Gang Xu, Chuanxiong Kang, Claudio Estevez, Chengzhi Deng, Shengqian Wang
Format: Article
Language:English
Published: Wiley 2020-01-01
Series:Complexity
Online Access:http://dx.doi.org/10.1155/2020/6589658
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849469471415074816
author Xiang Yu
Yu Qiao
Qingpeng Li
Gang Xu
Chuanxiong Kang
Claudio Estevez
Chengzhi Deng
Shengqian Wang
author_facet Xiang Yu
Yu Qiao
Qingpeng Li
Gang Xu
Chuanxiong Kang
Claudio Estevez
Chengzhi Deng
Shengqian Wang
author_sort Xiang Yu
collection DOAJ
description Comprehensive learning particle swarm optimization (CLPSO) is a powerful metaheuristic for global optimization. This paper studies parallelizing CLPSO by open computing language (OpenCL) on the integrated Intel HD Graphics 520 (IHDG520) graphical processing unit (GPU) with a low clock rate. We implement a coarse-grained all-GPU model that maps each particle to a separate work item. Two enhancement strategies, namely, generating and transferring random numbers from the central processor to the GPU as well as reducing the number of instructions in the kernel, are proposed to shorten the model’s execution time. This paper further investigates parallelizing deterministic optimization for implicit stochastic optimization of China’s Xiaowan Reservoir. The deterministic optimization is performed on an ensemble of 62 years’ historical inflow records with monthly time steps, is solved by CLPSO, and is parallelized by a coarse-grained multipopulation model extended from the all-GPU model. The multipopulation model involves a large number of work items. Because of the capacity limit for a buffer transferring data from the central processor to the GPU and the size of the global memory region, the random number generation strategy is modified by generating a small number of random numbers that can be flexibly exploited by the large number of work items. Experiments conducted on various benchmark functions and the case study demonstrate that our proposed all-GPU and multipopulation parallelization models are appropriate; and the multipopulation model achieves the consumption of significantly less execution time than the corresponding sequential model.
format Article
id doaj-art-fd9042eb70024ff6bc41e40ab256535e
institution Kabale University
issn 1076-2787
1099-0526
language English
publishDate 2020-01-01
publisher Wiley
record_format Article
series Complexity
spelling doaj-art-fd9042eb70024ff6bc41e40ab256535e2025-08-20T03:25:27ZengWileyComplexity1076-27871099-05262020-01-01202010.1155/2020/65896586589658Parallelizing Comprehensive Learning Particle Swarm Optimization by Open Computing Language on an Integrated Graphical Processing UnitXiang Yu0Yu Qiao1Qingpeng Li2Gang Xu3Chuanxiong Kang4Claudio Estevez5Chengzhi Deng6Shengqian Wang7Provincial Key Laboratory for Water Information Cooperative Sensing and Intelligent Processing, Nanchang Institute of Technology, Nanchang, Jiangxi 330099, ChinaSchool of Mathematics and Information Science, Shaanxi Normal University, Xi’an, Shaanxi 710119, ChinaState Grid Nanchang Electric Power Supply Company, Nanchang, Jiangxi 330069, ChinaDepartment of Mathematics, Nanchang University, Nanchang, Jiangxi 330031, ChinaProvincial Key Laboratory for Water Information Cooperative Sensing and Intelligent Processing, Nanchang Institute of Technology, Nanchang, Jiangxi 330099, ChinaDepartment of Electrical Engineering, University of Chile, Santiago 8370451, ChileProvincial Key Laboratory for Water Information Cooperative Sensing and Intelligent Processing, Nanchang Institute of Technology, Nanchang, Jiangxi 330099, ChinaProvincial Key Laboratory for Water Information Cooperative Sensing and Intelligent Processing, Nanchang Institute of Technology, Nanchang, Jiangxi 330099, ChinaComprehensive learning particle swarm optimization (CLPSO) is a powerful metaheuristic for global optimization. This paper studies parallelizing CLPSO by open computing language (OpenCL) on the integrated Intel HD Graphics 520 (IHDG520) graphical processing unit (GPU) with a low clock rate. We implement a coarse-grained all-GPU model that maps each particle to a separate work item. Two enhancement strategies, namely, generating and transferring random numbers from the central processor to the GPU as well as reducing the number of instructions in the kernel, are proposed to shorten the model’s execution time. This paper further investigates parallelizing deterministic optimization for implicit stochastic optimization of China’s Xiaowan Reservoir. The deterministic optimization is performed on an ensemble of 62 years’ historical inflow records with monthly time steps, is solved by CLPSO, and is parallelized by a coarse-grained multipopulation model extended from the all-GPU model. The multipopulation model involves a large number of work items. Because of the capacity limit for a buffer transferring data from the central processor to the GPU and the size of the global memory region, the random number generation strategy is modified by generating a small number of random numbers that can be flexibly exploited by the large number of work items. Experiments conducted on various benchmark functions and the case study demonstrate that our proposed all-GPU and multipopulation parallelization models are appropriate; and the multipopulation model achieves the consumption of significantly less execution time than the corresponding sequential model.http://dx.doi.org/10.1155/2020/6589658
spellingShingle Xiang Yu
Yu Qiao
Qingpeng Li
Gang Xu
Chuanxiong Kang
Claudio Estevez
Chengzhi Deng
Shengqian Wang
Parallelizing Comprehensive Learning Particle Swarm Optimization by Open Computing Language on an Integrated Graphical Processing Unit
Complexity
title Parallelizing Comprehensive Learning Particle Swarm Optimization by Open Computing Language on an Integrated Graphical Processing Unit
title_full Parallelizing Comprehensive Learning Particle Swarm Optimization by Open Computing Language on an Integrated Graphical Processing Unit
title_fullStr Parallelizing Comprehensive Learning Particle Swarm Optimization by Open Computing Language on an Integrated Graphical Processing Unit
title_full_unstemmed Parallelizing Comprehensive Learning Particle Swarm Optimization by Open Computing Language on an Integrated Graphical Processing Unit
title_short Parallelizing Comprehensive Learning Particle Swarm Optimization by Open Computing Language on an Integrated Graphical Processing Unit
title_sort parallelizing comprehensive learning particle swarm optimization by open computing language on an integrated graphical processing unit
url http://dx.doi.org/10.1155/2020/6589658
work_keys_str_mv AT xiangyu parallelizingcomprehensivelearningparticleswarmoptimizationbyopencomputinglanguageonanintegratedgraphicalprocessingunit
AT yuqiao parallelizingcomprehensivelearningparticleswarmoptimizationbyopencomputinglanguageonanintegratedgraphicalprocessingunit
AT qingpengli parallelizingcomprehensivelearningparticleswarmoptimizationbyopencomputinglanguageonanintegratedgraphicalprocessingunit
AT gangxu parallelizingcomprehensivelearningparticleswarmoptimizationbyopencomputinglanguageonanintegratedgraphicalprocessingunit
AT chuanxiongkang parallelizingcomprehensivelearningparticleswarmoptimizationbyopencomputinglanguageonanintegratedgraphicalprocessingunit
AT claudioestevez parallelizingcomprehensivelearningparticleswarmoptimizationbyopencomputinglanguageonanintegratedgraphicalprocessingunit
AT chengzhideng parallelizingcomprehensivelearningparticleswarmoptimizationbyopencomputinglanguageonanintegratedgraphicalprocessingunit
AT shengqianwang parallelizingcomprehensivelearningparticleswarmoptimizationbyopencomputinglanguageonanintegratedgraphicalprocessingunit