Lightweight Road Environment Segmentation using Vector Quantization

Road environment segmentation plays a significant role in autonomous driving. Numerous works based on Fully Convolutional Networks (FCNs) and Transformer architectures have been proposed to leverage local and global contextual learning for efficient and accurate semantic segmentation. In both archit...

Full description

Saved in:
Bibliographic Details
Main Authors: J. Kwag, A. Yilmaz, C. Toth
Format: Article
Language:English
Published: Copernicus Publications 2025-07-01
Series:ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Online Access:https://isprs-annals.copernicus.org/articles/X-G-2025/519/2025/isprs-annals-X-G-2025-519-2025.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849321046342107136
author J. Kwag
A. Yilmaz
C. Toth
author_facet J. Kwag
A. Yilmaz
C. Toth
author_sort J. Kwag
collection DOAJ
description Road environment segmentation plays a significant role in autonomous driving. Numerous works based on Fully Convolutional Networks (FCNs) and Transformer architectures have been proposed to leverage local and global contextual learning for efficient and accurate semantic segmentation. In both architectures, the encoder often relies heavily on extracting continuous representations from the image, which limits the ability to represent meaningful discrete information. To address this limitation, we propose segmentation of the autonomous driving environment using vector quantization. Vector quantization offers three primary advantages for road environment segmentation. (1) Each continuous feature from the encoder is mapped to a discrete vector from the codebook, helping the model discover distinct features more easily than with complex continuous features. (2) Since a discrete feature acts as compressed versions of the encoder’s continuous features, they also compress noise or outliers, enhancing the image segmentation task. (3) Vector quantization encourages the latent space to form coarse clusters of continuous features, forcing the model to group similar features, making the learned representations more structured for the decoding process. In this work, we combined vector quantization with the lightweight image segmentation model MobileUNETR and used it as a baseline model for comparison to demonstrate its efficiency. Through experiments, we achieved 77.0 % mIoU on Cityscapes, outperforming the baseline by 2.9 % without increasing the model’s initial size or complexity.
format Article
id doaj-art-ee40709b517346a8a44bd98aad51243a
institution Kabale University
issn 2194-9042
2194-9050
language English
publishDate 2025-07-01
publisher Copernicus Publications
record_format Article
series ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
spelling doaj-art-ee40709b517346a8a44bd98aad51243a2025-08-20T03:49:51ZengCopernicus PublicationsISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences2194-90422194-90502025-07-01X-G-202551952510.5194/isprs-annals-X-G-2025-519-2025Lightweight Road Environment Segmentation using Vector QuantizationJ. Kwag0A. Yilmaz1C. Toth2Department of Civil, Environmental, and Geodetic Engineering, The Ohio State University, 281 W Lane Ave, Columbus, Ohio, USADepartment of Civil, Environmental, and Geodetic Engineering, The Ohio State University, 281 W Lane Ave, Columbus, Ohio, USADepartment of Civil, Environmental, and Geodetic Engineering, The Ohio State University, 281 W Lane Ave, Columbus, Ohio, USARoad environment segmentation plays a significant role in autonomous driving. Numerous works based on Fully Convolutional Networks (FCNs) and Transformer architectures have been proposed to leverage local and global contextual learning for efficient and accurate semantic segmentation. In both architectures, the encoder often relies heavily on extracting continuous representations from the image, which limits the ability to represent meaningful discrete information. To address this limitation, we propose segmentation of the autonomous driving environment using vector quantization. Vector quantization offers three primary advantages for road environment segmentation. (1) Each continuous feature from the encoder is mapped to a discrete vector from the codebook, helping the model discover distinct features more easily than with complex continuous features. (2) Since a discrete feature acts as compressed versions of the encoder’s continuous features, they also compress noise or outliers, enhancing the image segmentation task. (3) Vector quantization encourages the latent space to form coarse clusters of continuous features, forcing the model to group similar features, making the learned representations more structured for the decoding process. In this work, we combined vector quantization with the lightweight image segmentation model MobileUNETR and used it as a baseline model for comparison to demonstrate its efficiency. Through experiments, we achieved 77.0 % mIoU on Cityscapes, outperforming the baseline by 2.9 % without increasing the model’s initial size or complexity.https://isprs-annals.copernicus.org/articles/X-G-2025/519/2025/isprs-annals-X-G-2025-519-2025.pdf
spellingShingle J. Kwag
A. Yilmaz
C. Toth
Lightweight Road Environment Segmentation using Vector Quantization
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
title Lightweight Road Environment Segmentation using Vector Quantization
title_full Lightweight Road Environment Segmentation using Vector Quantization
title_fullStr Lightweight Road Environment Segmentation using Vector Quantization
title_full_unstemmed Lightweight Road Environment Segmentation using Vector Quantization
title_short Lightweight Road Environment Segmentation using Vector Quantization
title_sort lightweight road environment segmentation using vector quantization
url https://isprs-annals.copernicus.org/articles/X-G-2025/519/2025/isprs-annals-X-G-2025-519-2025.pdf
work_keys_str_mv AT jkwag lightweightroadenvironmentsegmentationusingvectorquantization
AT ayilmaz lightweightroadenvironmentsegmentationusingvectorquantization
AT ctoth lightweightroadenvironmentsegmentationusingvectorquantization