Extracting urban spatial perception attributes and scene elements by integrating VGG-16 and CBAM

Abstract As urbanization continues to accelerate, there is a growing need for the analysis of urban spatial perception attributes and scene elements. In response, the research proposed a multi-scale perception network-based model for extracting urban scene elements and an attention-enhanced segmenta...

Full description

Saved in:
Bibliographic Details
Main Author: Jing Xu
Format: Article
Language:English
Published: Springer 2025-04-01
Series:Computational Urban Science
Subjects:
Online Access:https://doi.org/10.1007/s43762-025-00181-1
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849314589817176064
author Jing Xu
author_facet Jing Xu
author_sort Jing Xu
collection DOAJ
description Abstract As urbanization continues to accelerate, there is a growing need for the analysis of urban spatial perception attributes and scene elements. In response, the research proposed a multi-scale perception network-based model for extracting urban scene elements and an attention-enhanced segmentation network-based model for analyzing urban spatial scene structures. The urban scene feature extraction model incorporated Siamese convolutional neural networks and convolutional block attention to achieve multi-scale perception extraction. The urban spatial scene structure analysis model combined a dynamic attention module with an encoder-decoder architecture to enhance the accuracy of scene element segmentation. During testing of the urban scene feature extraction model at a resolution of 768, its classification accuracy and cross entropy were 95.4% and 0.065, respectively. The model's average ranking accuracy for beauty, comfort, and cleanliness was 92.5%, 91.8%, and 93.2%. In testing the urban spatial scene structure analysis model, the boundary intersection to union ratio and boundary F1 score were 81.2% and 82.1%, respectively, with a boundary complexity of 0.6. The results demonstrated that the proposed method excelled in tasks such as perceptual attribute classification and scene element parsing, effectively addressing complex and diverse urban spatial features.
format Article
id doaj-art-dfde1e4d645f455ca1e867cec93f5d34
institution Kabale University
issn 2730-6852
language English
publishDate 2025-04-01
publisher Springer
record_format Article
series Computational Urban Science
spelling doaj-art-dfde1e4d645f455ca1e867cec93f5d342025-08-20T03:52:24ZengSpringerComputational Urban Science2730-68522025-04-015111410.1007/s43762-025-00181-1Extracting urban spatial perception attributes and scene elements by integrating VGG-16 and CBAMJing Xu0Department of Architecture Engineering, City University of ZhengzhouAbstract As urbanization continues to accelerate, there is a growing need for the analysis of urban spatial perception attributes and scene elements. In response, the research proposed a multi-scale perception network-based model for extracting urban scene elements and an attention-enhanced segmentation network-based model for analyzing urban spatial scene structures. The urban scene feature extraction model incorporated Siamese convolutional neural networks and convolutional block attention to achieve multi-scale perception extraction. The urban spatial scene structure analysis model combined a dynamic attention module with an encoder-decoder architecture to enhance the accuracy of scene element segmentation. During testing of the urban scene feature extraction model at a resolution of 768, its classification accuracy and cross entropy were 95.4% and 0.065, respectively. The model's average ranking accuracy for beauty, comfort, and cleanliness was 92.5%, 91.8%, and 93.2%. In testing the urban spatial scene structure analysis model, the boundary intersection to union ratio and boundary F1 score were 81.2% and 82.1%, respectively, with a boundary complexity of 0.6. The results demonstrated that the proposed method excelled in tasks such as perceptual attribute classification and scene element parsing, effectively addressing complex and diverse urban spatial features.https://doi.org/10.1007/s43762-025-00181-1VGG-16CBAMCitySpatial scenePerceived attributesElement extraction
spellingShingle Jing Xu
Extracting urban spatial perception attributes and scene elements by integrating VGG-16 and CBAM
Computational Urban Science
VGG-16
CBAM
City
Spatial scene
Perceived attributes
Element extraction
title Extracting urban spatial perception attributes and scene elements by integrating VGG-16 and CBAM
title_full Extracting urban spatial perception attributes and scene elements by integrating VGG-16 and CBAM
title_fullStr Extracting urban spatial perception attributes and scene elements by integrating VGG-16 and CBAM
title_full_unstemmed Extracting urban spatial perception attributes and scene elements by integrating VGG-16 and CBAM
title_short Extracting urban spatial perception attributes and scene elements by integrating VGG-16 and CBAM
title_sort extracting urban spatial perception attributes and scene elements by integrating vgg 16 and cbam
topic VGG-16
CBAM
City
Spatial scene
Perceived attributes
Element extraction
url https://doi.org/10.1007/s43762-025-00181-1
work_keys_str_mv AT jingxu extractingurbanspatialperceptionattributesandsceneelementsbyintegratingvgg16andcbam