CAMPuS: Concurrent Acceleration of Memory Access and Parallel Processing in Near-Memory SpMV Architecture
Sparse matrix-vector multiplication (SpMV) is one of the key computations in many deep-learning networks. However, the performance to compute SpMV is often limited by the DRAM bandwidth. To resolve such DRAM bandwidth issues, several studies have proposed DIMM-based near-memory processing (NMP) arch...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2024-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10807171/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Sparse matrix-vector multiplication (SpMV) is one of the key computations in many deep-learning networks. However, the performance to compute SpMV is often limited by the DRAM bandwidth. To resolve such DRAM bandwidth issues, several studies have proposed DIMM-based near-memory processing (NMP) architectures to accelerate SpMV computation. However, these studies face limitations in performance improvements owing to computational overheads in NMP cores, memory access overheads, or both. In this paper, we propose a novel method called CAMPuS to accelerate SpMV operations on a DIMM-based NMP platform. CAMPuS is designed to enhance computational parallelism, thereby boosting computational performance while maintaining high memory access efficiency. One of the key ideas is dual storage management: one for the input vector and the other for the hot-spot vector, which stores a frequently-accessed part of the input vector. The paper also presents a hot-spot-aware remapping technique for the sparse matrix to optimally use the resources of the hot-spot vector. The results of simulations conducted indicate that CAMPuS significantly outperforms one of the state-of-the-art works called Fafnir by up to <inline-formula> <tex-math notation="LaTeX">$3.23\times $ </tex-math></inline-formula>. We design a Verilog-based NMP Execution system for an end-to-end evaluation, and the end-to-end evaluation of CAMPuS with the NMP Execution system confirms as <inline-formula> <tex-math notation="LaTeX">$4.49\times $ </tex-math></inline-formula> improvement over a software SpMV implementation using the OpenMP libraries, demonstrating the effectiveness of CAMPuS in the real-world computer system. |
|---|---|
| ISSN: | 2169-3536 |