CaLRS: A Critical-Aware Shared LLC Request Scheduling Algorithm on GPGPU

Ultra high thread-level parallelism in modern GPUs usually introduces numerous memory requests simultaneously. So there are always plenty of memory requests waiting at each bank of the shared LLC (L2 in this paper) and global memory. For global memory, various schedulers have already been developed...

Full description

Saved in:
Bibliographic Details
Main Authors: Jianliang Ma, Jinglei Meng, Tianzhou Chen, Minghui Wu
Format: Article
Language:English
Published: Wiley 2015-01-01
Series:The Scientific World Journal
Online Access:http://dx.doi.org/10.1155/2015/848416
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832568035696705536
author Jianliang Ma
Jinglei Meng
Tianzhou Chen
Minghui Wu
author_facet Jianliang Ma
Jinglei Meng
Tianzhou Chen
Minghui Wu
author_sort Jianliang Ma
collection DOAJ
description Ultra high thread-level parallelism in modern GPUs usually introduces numerous memory requests simultaneously. So there are always plenty of memory requests waiting at each bank of the shared LLC (L2 in this paper) and global memory. For global memory, various schedulers have already been developed to adjust the request sequence. But we find few work has ever focused on the service sequence on the shared LLC. We measured that a big number of GPU applications always queue at LLC bank for services, which provide opportunity to optimize the service order on LLC. Through adjusting the GPU memory request service order, we can improve the schedulability of SM. So we proposed a critical-aware shared LLC request scheduling algorithm (CaLRS) in this paper. The priority representative of memory request is critical for CaLRS. We use the number of memory requests that originate from the same warp but have not been serviced when they arrive at the shared LLC bank to represent the criticality of each warp. Experiments show that the proposed scheme can boost the SM schedulability effectively by promoting the scheduling priority of the memory requests with high criticality and improves the performance of GPU indirectly.
format Article
id doaj-art-da226da647b54108a97ab4f9395c6db7
institution Kabale University
issn 2356-6140
1537-744X
language English
publishDate 2015-01-01
publisher Wiley
record_format Article
series The Scientific World Journal
spelling doaj-art-da226da647b54108a97ab4f9395c6db72025-02-03T00:59:57ZengWileyThe Scientific World Journal2356-61401537-744X2015-01-01201510.1155/2015/848416848416CaLRS: A Critical-Aware Shared LLC Request Scheduling Algorithm on GPGPUJianliang Ma0Jinglei Meng1Tianzhou Chen2Minghui Wu3College of Computer Science, Zhejiang University, Zheda Road No. 38, Hangzhou 310013, ChinaCollege of Computer Science, Zhejiang University, Zheda Road No. 38, Hangzhou 310013, ChinaCollege of Computer Science, Zhejiang University, Zheda Road No. 38, Hangzhou 310013, ChinaZhejiang University City College, Huzhou Road No. 51, Hangzhou 310015, ChinaUltra high thread-level parallelism in modern GPUs usually introduces numerous memory requests simultaneously. So there are always plenty of memory requests waiting at each bank of the shared LLC (L2 in this paper) and global memory. For global memory, various schedulers have already been developed to adjust the request sequence. But we find few work has ever focused on the service sequence on the shared LLC. We measured that a big number of GPU applications always queue at LLC bank for services, which provide opportunity to optimize the service order on LLC. Through adjusting the GPU memory request service order, we can improve the schedulability of SM. So we proposed a critical-aware shared LLC request scheduling algorithm (CaLRS) in this paper. The priority representative of memory request is critical for CaLRS. We use the number of memory requests that originate from the same warp but have not been serviced when they arrive at the shared LLC bank to represent the criticality of each warp. Experiments show that the proposed scheme can boost the SM schedulability effectively by promoting the scheduling priority of the memory requests with high criticality and improves the performance of GPU indirectly.http://dx.doi.org/10.1155/2015/848416
spellingShingle Jianliang Ma
Jinglei Meng
Tianzhou Chen
Minghui Wu
CaLRS: A Critical-Aware Shared LLC Request Scheduling Algorithm on GPGPU
The Scientific World Journal
title CaLRS: A Critical-Aware Shared LLC Request Scheduling Algorithm on GPGPU
title_full CaLRS: A Critical-Aware Shared LLC Request Scheduling Algorithm on GPGPU
title_fullStr CaLRS: A Critical-Aware Shared LLC Request Scheduling Algorithm on GPGPU
title_full_unstemmed CaLRS: A Critical-Aware Shared LLC Request Scheduling Algorithm on GPGPU
title_short CaLRS: A Critical-Aware Shared LLC Request Scheduling Algorithm on GPGPU
title_sort calrs a critical aware shared llc request scheduling algorithm on gpgpu
url http://dx.doi.org/10.1155/2015/848416
work_keys_str_mv AT jianliangma calrsacriticalawaresharedllcrequestschedulingalgorithmongpgpu
AT jingleimeng calrsacriticalawaresharedllcrequestschedulingalgorithmongpgpu
AT tianzhouchen calrsacriticalawaresharedllcrequestschedulingalgorithmongpgpu
AT minghuiwu calrsacriticalawaresharedllcrequestschedulingalgorithmongpgpu