Based on linear systolic array for convolutional neural network’s calculation optimization and performance analysis

Concerning the issue that the convolutional neural network (CNN) accelerator design on most FPGA ends fails to effectively use the sparsity and considering both bandwidth and energy consumption,two improved CNN calculation optimization strategies based on linear systolic array architecture are propo...

Full description

Saved in:

Bibliographic Details
Main Authors:	Qinrang LIU, Chongyang LIU, Jun ZHOU, Xiaolong WANG
Format:	Article
Language:	English
Published:	POSTS&TELECOM PRESS Co., LTD 2018-12-01
Series:	网络与信息安全学报
Subjects:	linear systolic array convolutional neural network sparsity I/O bandwidth performance analysis
Online Access:	http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2018100
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Concerning the issue that the convolutional neural network (CNN) accelerator design on most FPGA ends fails to effectively use the sparsity and considering both bandwidth and energy consumption,two improved CNN calculation optimization strategies based on linear systolic array architecture are proposed.Firstly,convolution is transformed into matrix multiplication to take advantage of sparsity.Secondly,in order to solve the problem of large I/O demand in traditional parallel matrix multiplier,linear systolic array is used to improve the design.Finally,a CNN acceleration comparative analysis of the advantages and disadvantages between parallel matrix multiplier and two improved linear systolic arrays is presented.Theoretical proof and analysis show that compared with the parallel matrix multiplier,the two improved linear systolic arrays make full use of sparsity,and have the advantages of less energy consumption and less I/O bandwidth occupation.
ISSN:	2096-109X

Based on linear systolic array for convolutional neural network’s calculation optimization and performance analysis

Similar Items