Comparative analysis of neural network models performance on low-power devices for a real-time object detection task

A computer vision based real-time object detection on low-power devices is economically attractive, yet a technically challenging task. The paper presents results of benchmarks on popular deep neural network models, which are often used for this task. The results of experiments provide insights into...

Full description

Saved in:
Bibliographic Details
Main Authors: A. Zagitov, E. Chebotareva, A. Toschev, E. Magid
Format: Article
Language:English
Published: Samara National Research University 2024-04-01
Series:Компьютерная оптика
Subjects:
Online Access:https://www.computeroptics.ru/eng/KO/Annot/KO48-2/480211e.html
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832540917870886912
author A. Zagitov
E. Chebotareva
A. Toschev
E. Magid
author_facet A. Zagitov
E. Chebotareva
A. Toschev
E. Magid
author_sort A. Zagitov
collection DOAJ
description A computer vision based real-time object detection on low-power devices is economically attractive, yet a technically challenging task. The paper presents results of benchmarks on popular deep neural network models, which are often used for this task. The results of experiments provide insights into trade-offs between accuracy, speed, and computational efficiency of MobileNetV2 SSD, CenterNet MobileNetV2 FPN, EfficientDet, YoloV5, YoloV7, YoloV7 Tiny and YoloV8 neural network models on Raspberry Pi 4B, Raspberry Pi 3B and NVIDIA Jetson Nano with TensorFlow Lite. We fine-tuned the models on our custom dataset prior to benchmarking and used post-training quantization (PTQ) and quantization-aware training (QAT) to optimize the models’ size and speed. The experiments demonstrated that an appropriate algorithm selection depends on task requirements. We recommend EfficientDet Lite 512×512 quantized or YoloV7 Tiny for tasks that require around 2 FPS, EfficientDet Lite 320×320 quantized or SSD Mobilenet V2 320×320 for tasks with over 10 FPS, and EfficientDet Lite 320×320 or YoloV5 320×320 with QAT for tasks with intermediate FPS requirements.
format Article
id doaj-art-c6f4cbee7d484381910b3ec57d741bb0
institution Kabale University
issn 0134-2452
2412-6179
language English
publishDate 2024-04-01
publisher Samara National Research University
record_format Article
series Компьютерная оптика
spelling doaj-art-c6f4cbee7d484381910b3ec57d741bb02025-02-04T12:44:09ZengSamara National Research UniversityКомпьютерная оптика0134-24522412-61792024-04-0148224225210.18287/2412-6179-CO-1343Comparative analysis of neural network models performance on low-power devices for a real-time object detection taskA. Zagitov0E. Chebotareva1A. Toschev2E. Magid3Institute of Information Technology and Intelligent Systems, Kazan Federal UniversityInstitute of Information Technology and Intelligent Systems, Kazan Federal UniversityInstitute of Information Technology and Intelligent Systems, Kazan Federal UniversityInstitute of Information Technology and Intelligent Systems, Kazan Federal University; School of Electronic Engineering, Tikhonov Moscow Institute of Electronics and Mathematics, HSE UniversityA computer vision based real-time object detection on low-power devices is economically attractive, yet a technically challenging task. The paper presents results of benchmarks on popular deep neural network models, which are often used for this task. The results of experiments provide insights into trade-offs between accuracy, speed, and computational efficiency of MobileNetV2 SSD, CenterNet MobileNetV2 FPN, EfficientDet, YoloV5, YoloV7, YoloV7 Tiny and YoloV8 neural network models on Raspberry Pi 4B, Raspberry Pi 3B and NVIDIA Jetson Nano with TensorFlow Lite. We fine-tuned the models on our custom dataset prior to benchmarking and used post-training quantization (PTQ) and quantization-aware training (QAT) to optimize the models’ size and speed. The experiments demonstrated that an appropriate algorithm selection depends on task requirements. We recommend EfficientDet Lite 512×512 quantized or YoloV7 Tiny for tasks that require around 2 FPS, EfficientDet Lite 320×320 quantized or SSD Mobilenet V2 320×320 for tasks with over 10 FPS, and EfficientDet Lite 320×320 or YoloV5 320×320 with QAT for tasks with intermediate FPS requirements.https://www.computeroptics.ru/eng/KO/Annot/KO48-2/480211e.htmlcomputer visionimage analysisobject detectiondeep learningbenchmarkingoptimization techniques
spellingShingle A. Zagitov
E. Chebotareva
A. Toschev
E. Magid
Comparative analysis of neural network models performance on low-power devices for a real-time object detection task
Компьютерная оптика
computer vision
image analysis
object detection
deep learning
benchmarking
optimization techniques
title Comparative analysis of neural network models performance on low-power devices for a real-time object detection task
title_full Comparative analysis of neural network models performance on low-power devices for a real-time object detection task
title_fullStr Comparative analysis of neural network models performance on low-power devices for a real-time object detection task
title_full_unstemmed Comparative analysis of neural network models performance on low-power devices for a real-time object detection task
title_short Comparative analysis of neural network models performance on low-power devices for a real-time object detection task
title_sort comparative analysis of neural network models performance on low power devices for a real time object detection task
topic computer vision
image analysis
object detection
deep learning
benchmarking
optimization techniques
url https://www.computeroptics.ru/eng/KO/Annot/KO48-2/480211e.html
work_keys_str_mv AT azagitov comparativeanalysisofneuralnetworkmodelsperformanceonlowpowerdevicesforarealtimeobjectdetectiontask
AT echebotareva comparativeanalysisofneuralnetworkmodelsperformanceonlowpowerdevicesforarealtimeobjectdetectiontask
AT atoschev comparativeanalysisofneuralnetworkmodelsperformanceonlowpowerdevicesforarealtimeobjectdetectiontask
AT emagid comparativeanalysisofneuralnetworkmodelsperformanceonlowpowerdevicesforarealtimeobjectdetectiontask