Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception

The emergence of Multimodal Large Language Models (MLLMs) has made methods of artificial intelligence accessible to the general public in a conversational way. It offers tools for the automated visual assessment of the quality of a built environment for professionals of urban planning without requir...

Full description

Saved in:

Bibliographic Details
Main Author:	Rachid Belaroussi
Format:	Article
Language:	English
Published:	MDPI AG 2025-04-01
Series:	Big Data and Cognitive Computing
Subjects:	ChatGPT Gemini Grok built environment architecture
Online Access:	https://www.mdpi.com/2504-2289/9/4/100
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850144588410388480
author	Rachid Belaroussi
author_facet	Rachid Belaroussi
author_sort	Rachid Belaroussi
collection	DOAJ
description	The emergence of Multimodal Large Language Models (MLLMs) has made methods of artificial intelligence accessible to the general public in a conversational way. It offers tools for the automated visual assessment of the quality of a built environment for professionals of urban planning without requiring specific technical knowledge on computing. We investigated the capability of MLLMs to perceive urban environments based on images and textual prompts. We compared the outputs of several popular models—ChatGPT, Gemini and Grok—to the visual assessment of experts in Architecture, Engineering and Construction (AEC) in the context of a real estate construction project. Our analysis was based on subjective attributes proposed to characterize various aspects of a built environment. Four urban identities served as case studies, set in a virtual environment designed using professional 3D models. We found that there can be an alignment between human and AI evaluation on some aspects such as space and scale and architectural style, and more general accordance in environments with vegetation. However, there were noticeable differences in response patterns between the AIs and AEC experts, particularly concerning subjective aspects such as the general emotional resonance of specific urban identities. It raises questions regarding the hallucinations of generative AI where the AI invents information and behaves creatively but its outputs are not accurate.
format	Article
id	doaj-art-aef548c7dba2453788c94372aa2585da
institution	OA Journals
issn	2504-2289
language	English
publishDate	2025-04-01
publisher	MDPI AG
record_format	Article
series	Big Data and Cognitive Computing
spelling	doaj-art-aef548c7dba2453788c94372aa2585da2025-08-20T02:28:19ZengMDPI AGBig Data and Cognitive Computing2504-22892025-04-019410010.3390/bdcc9040100Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert PerceptionRachid Belaroussi0COSYS-GRETTIA, University Gustave Eiffel, F-77447 Marne-la-Vallée, FranceThe emergence of Multimodal Large Language Models (MLLMs) has made methods of artificial intelligence accessible to the general public in a conversational way. It offers tools for the automated visual assessment of the quality of a built environment for professionals of urban planning without requiring specific technical knowledge on computing. We investigated the capability of MLLMs to perceive urban environments based on images and textual prompts. We compared the outputs of several popular models—ChatGPT, Gemini and Grok—to the visual assessment of experts in Architecture, Engineering and Construction (AEC) in the context of a real estate construction project. Our analysis was based on subjective attributes proposed to characterize various aspects of a built environment. Four urban identities served as case studies, set in a virtual environment designed using professional 3D models. We found that there can be an alignment between human and AI evaluation on some aspects such as space and scale and architectural style, and more general accordance in environments with vegetation. However, there were noticeable differences in response patterns between the AIs and AEC experts, particularly concerning subjective aspects such as the general emotional resonance of specific urban identities. It raises questions regarding the hallucinations of generative AI where the AI invents information and behaves creatively but its outputs are not accurate.https://www.mdpi.com/2504-2289/9/4/100ChatGPTGeminiGrokbuilt environmentarchitecture
spellingShingle	Rachid Belaroussi Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception Big Data and Cognitive Computing ChatGPT Gemini Grok built environment architecture
title	Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception
title_full	Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception
title_fullStr	Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception
title_full_unstemmed	Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception
title_short	Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception
title_sort	subjective assessment of a built environment by chatgpt gemini and grok comparison with architecture engineering and construction expert perception
topic	ChatGPT Gemini Grok built environment architecture
url	https://www.mdpi.com/2504-2289/9/4/100
work_keys_str_mv	AT rachidbelaroussi subjectiveassessmentofabuiltenvironmentbychatgptgeminiandgrokcomparisonwitharchitectureengineeringandconstructionexpertperception

Subjective Assessment of a Built Environment by ChatGPT, Gemini and Grok: Comparison with Architecture, Engineering and Construction Expert Perception

Similar Items