Improving person re-identification based on two-stage training of convolutional neural networks and augmentation

Objectives. The main goal is to improve person re-identification accuracy in distributed video surveillance systems.Methods. Machine learning methods are applied.Result. A technology for two-stage training of convolutional neural networks (CNN) is presented, characterized by the use of image augment...

Full description

Saved in:
Bibliographic Details
Main Authors: S. A. Ihnatsyeva, R. P. Bohush
Format: Article
Language:Russian
Published: National Academy of Sciences of Belarus, the United Institute of Informatics Problems 2023-03-01
Series:Informatika
Subjects:
Online Access:https://inf.grid.by/jour/article/view/1225
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832543426825945088
author S. A. Ihnatsyeva
R. P. Bohush
author_facet S. A. Ihnatsyeva
R. P. Bohush
author_sort S. A. Ihnatsyeva
collection DOAJ
description Objectives. The main goal is to improve person re-identification accuracy in distributed video surveillance systems.Methods. Machine learning methods are applied.Result. A technology for two-stage training of convolutional neural networks (CNN) is presented, characterized by the use of image augmentation for the preliminary stage and fine tuning of weight coefficients based on the original images set for training. At the first stage, training is carried out on augmented data, at the second stage, fine tuning of the CNN is performed on the original images, which allows minimizing the losses and increasing model efficiency. The use of different data at different training stages does not allow the CNN to remember training examples, thereby preventing overfitting.Proposed method as expanding the training sample differs as it combines an image pixels cyclic shift, color  exclusion and fragment replacement with a reduced copy of another image. This augmentation method allows to get a wide variety of training data, which increases the CNN robustness to occlusions, illumination, low image resolution, dependence on the location of features.Conclusion. The use of two-stage learning technology and the proposed data augmentation method made it possible to increase the person re-identification accuracy for different CNNs and datasets: in the Rank1 metric  by 4–21 %; in the mAP by 10–31 %; in the mINP by 39–60 %.
format Article
id doaj-art-6ab3b11bd74a4b58895c8c876286f078
institution Kabale University
issn 1816-0301
language Russian
publishDate 2023-03-01
publisher National Academy of Sciences of Belarus, the United Institute of Informatics Problems
record_format Article
series Informatika
spelling doaj-art-6ab3b11bd74a4b58895c8c876286f0782025-02-03T11:40:30ZrusNational Academy of Sciences of Belarus, the United Institute of Informatics ProblemsInformatika1816-03012023-03-01201405410.37661/1816-0301-2023-20-1-40-541022Improving person re-identification based on two-stage training of convolutional neural networks and augmentationS. A. Ihnatsyeva0R. P. Bohush1Euphrosyne Polotskaya State University of PolotskEuphrosyne Polotskaya State University of PolotskObjectives. The main goal is to improve person re-identification accuracy in distributed video surveillance systems.Methods. Machine learning methods are applied.Result. A technology for two-stage training of convolutional neural networks (CNN) is presented, characterized by the use of image augmentation for the preliminary stage and fine tuning of weight coefficients based on the original images set for training. At the first stage, training is carried out on augmented data, at the second stage, fine tuning of the CNN is performed on the original images, which allows minimizing the losses and increasing model efficiency. The use of different data at different training stages does not allow the CNN to remember training examples, thereby preventing overfitting.Proposed method as expanding the training sample differs as it combines an image pixels cyclic shift, color  exclusion and fragment replacement with a reduced copy of another image. This augmentation method allows to get a wide variety of training data, which increases the CNN robustness to occlusions, illumination, low image resolution, dependence on the location of features.Conclusion. The use of two-stage learning technology and the proposed data augmentation method made it possible to increase the person re-identification accuracy for different CNNs and datasets: in the Rank1 metric  by 4–21 %; in the mAP by 10–31 %; in the mINP by 39–60 %.https://inf.grid.by/jour/article/view/1225person re-identificationconvolutional neural networkpre-trainfine tuningaugmentation
spellingShingle S. A. Ihnatsyeva
R. P. Bohush
Improving person re-identification based on two-stage training of convolutional neural networks and augmentation
Informatika
person re-identification
convolutional neural network
pre-train
fine tuning
augmentation
title Improving person re-identification based on two-stage training of convolutional neural networks and augmentation
title_full Improving person re-identification based on two-stage training of convolutional neural networks and augmentation
title_fullStr Improving person re-identification based on two-stage training of convolutional neural networks and augmentation
title_full_unstemmed Improving person re-identification based on two-stage training of convolutional neural networks and augmentation
title_short Improving person re-identification based on two-stage training of convolutional neural networks and augmentation
title_sort improving person re identification based on two stage training of convolutional neural networks and augmentation
topic person re-identification
convolutional neural network
pre-train
fine tuning
augmentation
url https://inf.grid.by/jour/article/view/1225
work_keys_str_mv AT saihnatsyeva improvingpersonreidentificationbasedontwostagetrainingofconvolutionalneuralnetworksandaugmentation
AT rpbohush improvingpersonreidentificationbasedontwostagetrainingofconvolutionalneuralnetworksandaugmentation