Method for Automatic Determination of a 3D Trajectory of Vehicles in a Video Image

Introduction. An important part of an automotive unmanned vehicle (UV) control system is the environment analysis module. This module is based on various types of sensors, e.g. video cameras, lidars and radars. The development of computer and video technologies makes it possible to implement an envi...

Full description

Saved in:
Bibliographic Details
Main Authors: I. G. Zubov, N. A. Obukhova
Format: Article
Language:Russian
Published: Saint Petersburg Electrotechnical University "LETI" 2021-06-01
Series:Известия высших учебных заведений России: Радиоэлектроника
Subjects:
Online Access:https://re.eltech.ru/jour/article/view/521
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849696197118263296
author I. G. Zubov
N. A. Obukhova
author_facet I. G. Zubov
N. A. Obukhova
author_sort I. G. Zubov
collection DOAJ
description Introduction. An important part of an automotive unmanned vehicle (UV) control system is the environment analysis module. This module is based on various types of sensors, e.g. video cameras, lidars and radars. The development of computer and video technologies makes it possible to implement an environment analysis module using a single video camera as a sensor. This approach is expected to reduce the cost of the entire module. The main task in video image processing is to analyse the environment as a 3D scene. The 3D trajectory of an object, which takes into account its dimensions, angle of view and movement vector, as well as the vehicle pose in a video image, provides sufficient information for assessing the real interaction of objects. A basis for constructing a 3D trajectory is vehicle pose estimation.Aim. To develop an automatic method for estimating vehicle pose based on video data analysis from a single video camera.Materials and methods. An automatic method for vehicle pose estimation from a video image was proposed based on a cascade approach. The method includes vehicle detection, key points determination, segmentation and vehicle pose estimation. Vehicle detection and determination of its key points were resolved via a neural network. The segmentation of a vehicle video image and its mask preparation were implemented by transforming it into a polar coordinate system and searching for the outer contour using graph theory.Results. The estimation of vehicle pose was implemented by matching the Fourier image of vehicle mask signatures and the templates obtained based on 3D models. The correctness of the obtained vehicle pose and angle of view estimation was confirmed by experiments based on the proposed method. The vehicle pose estimation had an accuracy of 89 % on an open Carvana image dataset.Conclusion. A new approach for vehicle pose estimation was proposed, involving the transition from end-to-end learning of neural networks to resolve several problems at once, e.g., localization, classification, segmentation, and angle of view, towards cascade analysis of information. The accuracy level of end-to-end learning requires large sets of representative data, which complicates the scalability of solutions for road environments in Russia. The proposed method makes it possible to estimate the vehicle pose with a high accuracy level, at the same time as involving no large costs for manual data annotation and training.
format Article
id doaj-art-df2e05333bbe45aaad0527fa5ade41a4
institution DOAJ
issn 1993-8985
2658-4794
language Russian
publishDate 2021-06-01
publisher Saint Petersburg Electrotechnical University "LETI"
record_format Article
series Известия высших учебных заведений России: Радиоэлектроника
spelling doaj-art-df2e05333bbe45aaad0527fa5ade41a42025-08-20T03:19:32ZrusSaint Petersburg Electrotechnical University "LETI"Известия высших учебных заведений России: Радиоэлектроника1993-89852658-47942021-06-01243495910.32603/1993-8985-2021-24-3-49-59390Method for Automatic Determination of a 3D Trajectory of Vehicles in a Video ImageI. G. Zubov0N. A. Obukhova1Ltd "Next"Saint Petersburg Electrotechnical UniversityIntroduction. An important part of an automotive unmanned vehicle (UV) control system is the environment analysis module. This module is based on various types of sensors, e.g. video cameras, lidars and radars. The development of computer and video technologies makes it possible to implement an environment analysis module using a single video camera as a sensor. This approach is expected to reduce the cost of the entire module. The main task in video image processing is to analyse the environment as a 3D scene. The 3D trajectory of an object, which takes into account its dimensions, angle of view and movement vector, as well as the vehicle pose in a video image, provides sufficient information for assessing the real interaction of objects. A basis for constructing a 3D trajectory is vehicle pose estimation.Aim. To develop an automatic method for estimating vehicle pose based on video data analysis from a single video camera.Materials and methods. An automatic method for vehicle pose estimation from a video image was proposed based on a cascade approach. The method includes vehicle detection, key points determination, segmentation and vehicle pose estimation. Vehicle detection and determination of its key points were resolved via a neural network. The segmentation of a vehicle video image and its mask preparation were implemented by transforming it into a polar coordinate system and searching for the outer contour using graph theory.Results. The estimation of vehicle pose was implemented by matching the Fourier image of vehicle mask signatures and the templates obtained based on 3D models. The correctness of the obtained vehicle pose and angle of view estimation was confirmed by experiments based on the proposed method. The vehicle pose estimation had an accuracy of 89 % on an open Carvana image dataset.Conclusion. A new approach for vehicle pose estimation was proposed, involving the transition from end-to-end learning of neural networks to resolve several problems at once, e.g., localization, classification, segmentation, and angle of view, towards cascade analysis of information. The accuracy level of end-to-end learning requires large sets of representative data, which complicates the scalability of solutions for road environments in Russia. The proposed method makes it possible to estimate the vehicle pose with a high accuracy level, at the same time as involving no large costs for manual data annotation and training.https://re.eltech.ru/jour/article/view/521convolutional neural networksanalysis of activation mapsdetection of key pointsimage segmentationpattern matching
spellingShingle I. G. Zubov
N. A. Obukhova
Method for Automatic Determination of a 3D Trajectory of Vehicles in a Video Image
Известия высших учебных заведений России: Радиоэлектроника
convolutional neural networks
analysis of activation maps
detection of key points
image segmentation
pattern matching
title Method for Automatic Determination of a 3D Trajectory of Vehicles in a Video Image
title_full Method for Automatic Determination of a 3D Trajectory of Vehicles in a Video Image
title_fullStr Method for Automatic Determination of a 3D Trajectory of Vehicles in a Video Image
title_full_unstemmed Method for Automatic Determination of a 3D Trajectory of Vehicles in a Video Image
title_short Method for Automatic Determination of a 3D Trajectory of Vehicles in a Video Image
title_sort method for automatic determination of a 3d trajectory of vehicles in a video image
topic convolutional neural networks
analysis of activation maps
detection of key points
image segmentation
pattern matching
url https://re.eltech.ru/jour/article/view/521
work_keys_str_mv AT igzubov methodforautomaticdeterminationofa3dtrajectoryofvehiclesinavideoimage
AT naobukhova methodforautomaticdeterminationofa3dtrajectoryofvehiclesinavideoimage