Deep Learning Scheduling on a Field-Programmable Gate Array Cluster Using Configurable Deep Learning Accelerators

This paper presents the development and evaluation of a distributed system employing low-latency embedded field-programmable gate arrays (FPGAs) to optimize scheduling for deep learning (DL) workloads and to configure multiple deep learning accelerator (DLA) architectures. Aimed at advancing FPGA ap...

Full description

Saved in:
Bibliographic Details
Main Authors: Tianyang Fang, Alejandro Perez-Vicente, Hans Johnson, Jafar Saniie
Format: Article
Language:English
Published: MDPI AG 2025-04-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/16/4/298
Tags: Add Tag
No Tags, Be the first to tag this record!