dStream: An Online-Based Dynamic Operator-Level Query Mapping Scheme on Discrete CPU-GPU Architectures

In streaming systems with a discrete CPU-GPU architecture, leveraging the strengths of both processing units can significantly improve query performance. Existing studies assign entire queries to either the CPU or GPU to reduce data transfer overhead and boost throughput. However, this coarse-graine...

Full description

Saved in:
Bibliographic Details
Main Authors: Gyeonghwan Jung, Yeonwoo Jeong, Kyuli Park, Dongjae Lee, Hongsu Byun, Suyeon Lee, Sungyong Park
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10776952/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841536207695118336
author Gyeonghwan Jung
Yeonwoo Jeong
Kyuli Park
Dongjae Lee
Hongsu Byun
Suyeon Lee
Sungyong Park
author_facet Gyeonghwan Jung
Yeonwoo Jeong
Kyuli Park
Dongjae Lee
Hongsu Byun
Suyeon Lee
Sungyong Park
author_sort Gyeonghwan Jung
collection DOAJ
description In streaming systems with a discrete CPU-GPU architecture, leveraging the strengths of both processing units can significantly improve query performance. Existing studies assign entire queries to either the CPU or GPU to reduce data transfer overhead and boost throughput. However, this coarse-grained approach can limit performance for two main reasons. Firstly, PCIe transfer overhead is minimal for small data sizes, and the device preference of each operator within a query may change with variations in data size. Secondly, it neglects performance fluctuations based on the placement location of consecutive operators within the device. To address these issues, we propose dSTREAM, a distributed stream processing system that dynamically maps queries at the operator level on discrete CPU-GPU architectures. dSTREAM adapts to runtime conditions by selecting the optimal device for each operator dynamically. Through dynamic operator-level query mapping without prior knowledge, dSTREAM consistently achieves high performance. Extensive evaluation has confirmed that dSTREAM enhances average throughput by up to 45% and reduces average latency by up to 42.5% across various types of stream SQL queries, regardless of traffic types.
format Article
id doaj-art-cb4cbe0129c64d1c9ab2237da2254178
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-cb4cbe0129c64d1c9ab2237da22541782025-01-15T00:02:55ZengIEEEIEEE Access2169-35362025-01-01138239825610.1109/ACCESS.2024.351088510776952dStream: An Online-Based Dynamic Operator-Level Query Mapping Scheme on Discrete CPU-GPU ArchitecturesGyeonghwan Jung0https://orcid.org/0009-0002-9393-3936Yeonwoo Jeong1https://orcid.org/0000-0003-4653-115XKyuli Park2https://orcid.org/0009-0001-2404-3430Dongjae Lee3https://orcid.org/0009-0005-8487-9888Hongsu Byun4Suyeon Lee5https://orcid.org/0000-0002-5526-6127Sungyong Park6https://orcid.org/0000-0002-0309-1820Department of Computer Science and Engineering, Sogang University, Seoul, South KoreaDepartment of Computer Science and Engineering, Sogang University, Seoul, South KoreaDepartment of Computer Science and Engineering, Sogang University, Seoul, South KoreaDepartment of Computer Science and Engineering, Sogang University, Seoul, South KoreaDepartment of Computer Science and Engineering, Sogang University, Seoul, South KoreaDepartment of Computer Science, Georgia Institute of Technology, Atlanta, GA, USADepartment of Computer Science and Engineering, Sogang University, Seoul, South KoreaIn streaming systems with a discrete CPU-GPU architecture, leveraging the strengths of both processing units can significantly improve query performance. Existing studies assign entire queries to either the CPU or GPU to reduce data transfer overhead and boost throughput. However, this coarse-grained approach can limit performance for two main reasons. Firstly, PCIe transfer overhead is minimal for small data sizes, and the device preference of each operator within a query may change with variations in data size. Secondly, it neglects performance fluctuations based on the placement location of consecutive operators within the device. To address these issues, we propose dSTREAM, a distributed stream processing system that dynamically maps queries at the operator level on discrete CPU-GPU architectures. dSTREAM adapts to runtime conditions by selecting the optimal device for each operator dynamically. Through dynamic operator-level query mapping without prior knowledge, dSTREAM consistently achieves high performance. Extensive evaluation has confirmed that dSTREAM enhances average throughput by up to 45% and reduces average latency by up to 42.5% across various types of stream SQL queries, regardless of traffic types.https://ieeexplore.ieee.org/document/10776952/Query planningdata stream processingheterogeneous architectures
spellingShingle Gyeonghwan Jung
Yeonwoo Jeong
Kyuli Park
Dongjae Lee
Hongsu Byun
Suyeon Lee
Sungyong Park
dStream: An Online-Based Dynamic Operator-Level Query Mapping Scheme on Discrete CPU-GPU Architectures
IEEE Access
Query planning
data stream processing
heterogeneous architectures
title dStream: An Online-Based Dynamic Operator-Level Query Mapping Scheme on Discrete CPU-GPU Architectures
title_full dStream: An Online-Based Dynamic Operator-Level Query Mapping Scheme on Discrete CPU-GPU Architectures
title_fullStr dStream: An Online-Based Dynamic Operator-Level Query Mapping Scheme on Discrete CPU-GPU Architectures
title_full_unstemmed dStream: An Online-Based Dynamic Operator-Level Query Mapping Scheme on Discrete CPU-GPU Architectures
title_short dStream: An Online-Based Dynamic Operator-Level Query Mapping Scheme on Discrete CPU-GPU Architectures
title_sort dstream an online based dynamic operator level query mapping scheme on discrete cpu gpu architectures
topic Query planning
data stream processing
heterogeneous architectures
url https://ieeexplore.ieee.org/document/10776952/
work_keys_str_mv AT gyeonghwanjung dstreamanonlinebaseddynamicoperatorlevelquerymappingschemeondiscretecpugpuarchitectures
AT yeonwoojeong dstreamanonlinebaseddynamicoperatorlevelquerymappingschemeondiscretecpugpuarchitectures
AT kyulipark dstreamanonlinebaseddynamicoperatorlevelquerymappingschemeondiscretecpugpuarchitectures
AT dongjaelee dstreamanonlinebaseddynamicoperatorlevelquerymappingschemeondiscretecpugpuarchitectures
AT hongsubyun dstreamanonlinebaseddynamicoperatorlevelquerymappingschemeondiscretecpugpuarchitectures
AT suyeonlee dstreamanonlinebaseddynamicoperatorlevelquerymappingschemeondiscretecpugpuarchitectures
AT sungyongpark dstreamanonlinebaseddynamicoperatorlevelquerymappingschemeondiscretecpugpuarchitectures