Heavy and Lightweight Deep Learning Models for Semantic Segmentation: A Survey

Semantic segmentation is an important computer vision task due to its numerous real-world applications such as autonomous driving, video surveillance, medical image analysis, robotics, augmented reality, among others, and its popularity increased with the development of deep learning approaches. We...

Full description

Saved in:

Bibliographic Details
Main Authors:	Cristina Carunta, Alina Carunta, Calin-Adrian Popa
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Complex deep learning models convolutional neural networks real-time models semantic segmentation Transformer model
Online Access:	https://ieeexplore.ieee.org/document/10840225/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832576775048134656
author	Cristina Carunta Alina Carunta Calin-Adrian Popa
author_facet	Cristina Carunta Alina Carunta Calin-Adrian Popa
author_sort	Cristina Carunta
collection	DOAJ
description	Semantic segmentation is an important computer vision task due to its numerous real-world applications such as autonomous driving, video surveillance, medical image analysis, robotics, augmented reality, among others, and its popularity increased with the development of deep learning approaches. We provide a detailed review comprising the most significant methods for both heavy and lightweight two-dimensional (2D) semantic segmentation, starting with the introduction of convolutional neural networks until the use of Transformer architecture, the latter being a widely adopted model with state-of-the-art results in several artificial intelligence fields. The methods involved are described from the architectural design perspective, including encoder-decoder architectures, multi-resolution branches approaches, two-pathway encoder architectures, attention-based models, and pyramid-based models. Additionally, some of the most popular datasets and performance metrics are presented. Further, we investigate the limitations of these methods, compare their performance on Pascal VOC 2012, Cityscapes, and ADE20K datasets, and finally indicate future research directions.
format	Article
id	doaj-art-a407120564774f3eb83c7cdf5cba036f
institution	Kabale University
issn	2169-3536
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-a407120564774f3eb83c7cdf5cba036f2025-01-31T00:01:31ZengIEEEIEEE Access2169-35362025-01-0113177451776510.1109/ACCESS.2025.352981210840225Heavy and Lightweight Deep Learning Models for Semantic Segmentation: A SurveyCristina Carunta0https://orcid.org/0009-0003-7597-0781Alina Carunta1https://orcid.org/0009-0005-0036-5418Calin-Adrian Popa2https://orcid.org/0000-0003-4445-8091Department of Computers and Information Technology, Politehnica University of Timişoara, Timişoara, RomaniaDepartment of Computer Science, West University of Timişoara, Timişoara, RomaniaDepartment of Computers and Information Technology, Politehnica University of Timişoara, Timişoara, RomaniaSemantic segmentation is an important computer vision task due to its numerous real-world applications such as autonomous driving, video surveillance, medical image analysis, robotics, augmented reality, among others, and its popularity increased with the development of deep learning approaches. We provide a detailed review comprising the most significant methods for both heavy and lightweight two-dimensional (2D) semantic segmentation, starting with the introduction of convolutional neural networks until the use of Transformer architecture, the latter being a widely adopted model with state-of-the-art results in several artificial intelligence fields. The methods involved are described from the architectural design perspective, including encoder-decoder architectures, multi-resolution branches approaches, two-pathway encoder architectures, attention-based models, and pyramid-based models. Additionally, some of the most popular datasets and performance metrics are presented. Further, we investigate the limitations of these methods, compare their performance on Pascal VOC 2012, Cityscapes, and ADE20K datasets, and finally indicate future research directions.https://ieeexplore.ieee.org/document/10840225/Complex deep learning modelsconvolutional neural networksreal-time modelssemantic segmentationTransformer model
spellingShingle	Cristina Carunta Alina Carunta Calin-Adrian Popa Heavy and Lightweight Deep Learning Models for Semantic Segmentation: A Survey IEEE Access Complex deep learning models convolutional neural networks real-time models semantic segmentation Transformer model
title	Heavy and Lightweight Deep Learning Models for Semantic Segmentation: A Survey
title_full	Heavy and Lightweight Deep Learning Models for Semantic Segmentation: A Survey
title_fullStr	Heavy and Lightweight Deep Learning Models for Semantic Segmentation: A Survey
title_full_unstemmed	Heavy and Lightweight Deep Learning Models for Semantic Segmentation: A Survey
title_short	Heavy and Lightweight Deep Learning Models for Semantic Segmentation: A Survey
title_sort	heavy and lightweight deep learning models for semantic segmentation a survey
topic	Complex deep learning models convolutional neural networks real-time models semantic segmentation Transformer model
url	https://ieeexplore.ieee.org/document/10840225/
work_keys_str_mv	AT cristinacarunta heavyandlightweightdeeplearningmodelsforsemanticsegmentationasurvey AT alinacarunta heavyandlightweightdeeplearningmodelsforsemanticsegmentationasurvey AT calinadrianpopa heavyandlightweightdeeplearningmodelsforsemanticsegmentationasurvey

Heavy and Lightweight Deep Learning Models for Semantic Segmentation: A Survey

Similar Items