Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception

The emergence of Multimodal Large Language Models (MLLMs) has made methods of artificial intelligence accessible to the general public in a conversational way. It offers tools for the automated visual assessment of the quality of a built environment for professionals of urban planning without requir...

Full description

Saved in:
Bibliographic Details
Main Author: Rachid Belaroussi
Format: Article
Language:English
Published: MDPI AG 2025-04-01
Series:Big Data and Cognitive Computing
Subjects:
Online Access:https://www.mdpi.com/2504-2289/9/4/100
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850144588410388480
author Rachid Belaroussi
author_facet Rachid Belaroussi
author_sort Rachid Belaroussi
collection DOAJ
description The emergence of Multimodal Large Language Models (MLLMs) has made methods of artificial intelligence accessible to the general public in a conversational way. It offers tools for the automated visual assessment of the quality of a built environment for professionals of urban planning without requiring specific technical knowledge on computing. We investigated the capability of MLLMs to perceive urban environments based on images and textual prompts. We compared the outputs of several popular models—ChatGPT, Gemini and Grok—to the visual assessment of experts in Architecture, Engineering and Construction (AEC) in the context of a real estate construction project. Our analysis was based on subjective attributes proposed to characterize various aspects of a built environment. Four urban identities served as case studies, set in a virtual environment designed using professional 3D models. We found that there can be an alignment between human and AI evaluation on some aspects such as space and scale and architectural style, and more general accordance in environments with vegetation. However, there were noticeable differences in response patterns between the AIs and AEC experts, particularly concerning subjective aspects such as the general emotional resonance of specific urban identities. It raises questions regarding the hallucinations of generative AI where the AI invents information and behaves creatively but its outputs are not accurate.
format Article
id doaj-art-aef548c7dba2453788c94372aa2585da
institution OA Journals
issn 2504-2289
language English
publishDate 2025-04-01
publisher MDPI AG
record_format Article
series Big Data and Cognitive Computing
spelling doaj-art-aef548c7dba2453788c94372aa2585da2025-08-20T02:28:19ZengMDPI AGBig Data and Cognitive Computing2504-22892025-04-019410010.3390/bdcc9040100Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert PerceptionRachid Belaroussi0COSYS-GRETTIA, University Gustave Eiffel, F-77447 Marne-la-Vallée, FranceThe emergence of Multimodal Large Language Models (MLLMs) has made methods of artificial intelligence accessible to the general public in a conversational way. It offers tools for the automated visual assessment of the quality of a built environment for professionals of urban planning without requiring specific technical knowledge on computing. We investigated the capability of MLLMs to perceive urban environments based on images and textual prompts. We compared the outputs of several popular models—ChatGPT, Gemini and Grok—to the visual assessment of experts in Architecture, Engineering and Construction (AEC) in the context of a real estate construction project. Our analysis was based on subjective attributes proposed to characterize various aspects of a built environment. Four urban identities served as case studies, set in a virtual environment designed using professional 3D models. We found that there can be an alignment between human and AI evaluation on some aspects such as space and scale and architectural style, and more general accordance in environments with vegetation. However, there were noticeable differences in response patterns between the AIs and AEC experts, particularly concerning subjective aspects such as the general emotional resonance of specific urban identities. It raises questions regarding the hallucinations of generative AI where the AI invents information and behaves creatively but its outputs are not accurate.https://www.mdpi.com/2504-2289/9/4/100ChatGPTGeminiGrokbuilt environmentarchitecture
spellingShingle Rachid Belaroussi
Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception
Big Data and Cognitive Computing
ChatGPT
Gemini
Grok
built environment
architecture
title Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception
title_full Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception
title_fullStr Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception
title_full_unstemmed Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception
title_short Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception
title_sort subjective assessment of a built environment by chatgpt gemini and grok comparison with architecture engineering and construction expert perception
topic ChatGPT
Gemini
Grok
built environment
architecture
url https://www.mdpi.com/2504-2289/9/4/100
work_keys_str_mv AT rachidbelaroussi subjectiveassessmentofabuiltenvironmentbychatgptgeminiandgrokcomparisonwitharchitectureengineeringandconstructionexpertperception