A Comprehensive Analysis of a Social Intelligence Dataset and Response Tendencies Between Large Language Models (LLMs) and Humans

In recent years, advancements in the interaction and collaboration between humans and have garnered significant attention. Social intelligence plays a crucial role in facilitating natural interactions and seamless communication between humans and Artificial Intelligence (AI). To assess AI’s ability...

Full description

Saved in:

Bibliographic Details
Main Authors:	Erika Mori, Yue Qiu, Hirokatsu Kataoka, Yoshimitsu Aoki
Format:	Article
Language:	English
Published:	MDPI AG 2025-01-01
Series:	Sensors
Subjects:	human–robot interaction social intelligence understanding human behavior emotion recognition large language models (LLMs) VideoQA
Online Access:	https://www.mdpi.com/1424-8220/25/2/477
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832587494488539136
author	Erika Mori Yue Qiu Hirokatsu Kataoka Yoshimitsu Aoki
author_facet	Erika Mori Yue Qiu Hirokatsu Kataoka Yoshimitsu Aoki
author_sort	Erika Mori
collection	DOAJ
description	In recent years, advancements in the interaction and collaboration between humans and have garnered significant attention. Social intelligence plays a crucial role in facilitating natural interactions and seamless communication between humans and Artificial Intelligence (AI). To assess AI’s ability to understand human interactions and the components necessary for such comprehension, datasets like Social-IQ have been developed. However, these datasets often rely on a simplistic question-and-answer format and lack justifications for the provided answers. Furthermore, existing methods typically produce direct answers by selecting from predefined choices without generating intermediate outputs, which hampers interpretability and reliability. To address these limitations, we conducted a comprehensive evaluation of AI methods on a video-based Question Answering (QA) benchmark focused on human interactions, leveraging additional annotations related to human responses. Our analysis highlights significant differences between human and AI response patterns and underscores critical shortcomings in current benchmarks. We anticipate that these findings will guide the creation of more advanced datasets and represent an important step toward achieving natural communication between humans and AI.
format	Article
id	doaj-art-531453e2866d482ba39bf7b46725e921
institution	Kabale University
issn	1424-8220
language	English
publishDate	2025-01-01
publisher	MDPI AG
record_format	Article
series	Sensors
spelling	doaj-art-531453e2866d482ba39bf7b46725e9212025-01-24T13:49:04ZengMDPI AGSensors1424-82202025-01-0125247710.3390/s25020477A Comprehensive Analysis of a Social Intelligence Dataset and Response Tendencies Between Large Language Models (LLMs) and HumansErika Mori0Yue Qiu1Hirokatsu Kataoka2Yoshimitsu Aoki3National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba 305-8560, JapanNational Institute of Advanced Industrial Science and Technology (AIST), Tsukuba 305-8560, JapanNational Institute of Advanced Industrial Science and Technology (AIST), Tsukuba 305-8560, JapanDepartment of Electronics and Electrical Engineering, Faculty of Science and Technology, Keio University, 3-14-1, Hiyoshi, Kohoku-ku, Yokohama 223-8522, JapanIn recent years, advancements in the interaction and collaboration between humans and have garnered significant attention. Social intelligence plays a crucial role in facilitating natural interactions and seamless communication between humans and Artificial Intelligence (AI). To assess AI’s ability to understand human interactions and the components necessary for such comprehension, datasets like Social-IQ have been developed. However, these datasets often rely on a simplistic question-and-answer format and lack justifications for the provided answers. Furthermore, existing methods typically produce direct answers by selecting from predefined choices without generating intermediate outputs, which hampers interpretability and reliability. To address these limitations, we conducted a comprehensive evaluation of AI methods on a video-based Question Answering (QA) benchmark focused on human interactions, leveraging additional annotations related to human responses. Our analysis highlights significant differences between human and AI response patterns and underscores critical shortcomings in current benchmarks. We anticipate that these findings will guide the creation of more advanced datasets and represent an important step toward achieving natural communication between humans and AI.https://www.mdpi.com/1424-8220/25/2/477human–robot interactionsocial intelligenceunderstanding human behavioremotion recognitionlarge language models (LLMs)VideoQA
spellingShingle	Erika Mori Yue Qiu Hirokatsu Kataoka Yoshimitsu Aoki A Comprehensive Analysis of a Social Intelligence Dataset and Response Tendencies Between Large Language Models (LLMs) and Humans Sensors human–robot interaction social intelligence understanding human behavior emotion recognition large language models (LLMs) VideoQA
title	A Comprehensive Analysis of a Social Intelligence Dataset and Response Tendencies Between Large Language Models (LLMs) and Humans
title_full	A Comprehensive Analysis of a Social Intelligence Dataset and Response Tendencies Between Large Language Models (LLMs) and Humans
title_fullStr	A Comprehensive Analysis of a Social Intelligence Dataset and Response Tendencies Between Large Language Models (LLMs) and Humans
title_full_unstemmed	A Comprehensive Analysis of a Social Intelligence Dataset and Response Tendencies Between Large Language Models (LLMs) and Humans
title_short	A Comprehensive Analysis of a Social Intelligence Dataset and Response Tendencies Between Large Language Models (LLMs) and Humans
title_sort	comprehensive analysis of a social intelligence dataset and response tendencies between large language models llms and humans
topic	human–robot interaction social intelligence understanding human behavior emotion recognition large language models (LLMs) VideoQA
url	https://www.mdpi.com/1424-8220/25/2/477
work_keys_str_mv	AT erikamori acomprehensiveanalysisofasocialintelligencedatasetandresponsetendenciesbetweenlargelanguagemodelsllmsandhumans AT yueqiu acomprehensiveanalysisofasocialintelligencedatasetandresponsetendenciesbetweenlargelanguagemodelsllmsandhumans AT hirokatsukataoka acomprehensiveanalysisofasocialintelligencedatasetandresponsetendenciesbetweenlargelanguagemodelsllmsandhumans AT yoshimitsuaoki acomprehensiveanalysisofasocialintelligencedatasetandresponsetendenciesbetweenlargelanguagemodelsllmsandhumans AT erikamori comprehensiveanalysisofasocialintelligencedatasetandresponsetendenciesbetweenlargelanguagemodelsllmsandhumans AT yueqiu comprehensiveanalysisofasocialintelligencedatasetandresponsetendenciesbetweenlargelanguagemodelsllmsandhumans AT hirokatsukataoka comprehensiveanalysisofasocialintelligencedatasetandresponsetendenciesbetweenlargelanguagemodelsllmsandhumans AT yoshimitsuaoki comprehensiveanalysisofasocialintelligencedatasetandresponsetendenciesbetweenlargelanguagemodelsllmsandhumans

A Comprehensive Analysis of a Social Intelligence Dataset and Response Tendencies Between Large Language Models (LLMs) and Humans

Similar Items