Evaluating the Intelligence of large language models: A comparative study using verbal and visual IQ tests

Evaluating the Intelligence of large language models: A comparative study using verbal and visual IQ tests

Large language models (LLMs) excel on many specialized benchmarks, yet their general-reasoning ability remains opaque. We therefore test 18 models – including GPT-4, Claude 3 and Gemini Pro – on a 14-section IQ suite spanning verbal, numerical and visual puzzles and add a “multi-agent reflection” va...

Full description

Saved in:

Bibliographic Details
Main Authors:	Sherif Abdelkarim, David Lu, Dora-Luz Flores, Susanne Jaeggi, Pierre Baldi
Format:	Article
Language:	English
Published:	Elsevier 2025-08-01
Series:	Computers in Human Behavior: Artificial Humans
Subjects:	Large language models Intelligence Quotient Artificial Intelligence
Online Access:	http://www.sciencedirect.com/science/article/pii/S2949882125000544
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Carbonated drinks, chips intake and their relation to Intelligence Quotient (IQ) among primary school children in Baghdad city, Iraq
by: Hasanain Faisal Ghazi, et al.
Published: (2012-11-01)

Brain Size or Brain Organization and Intelligence
by: J Gordon Millichap
Published: (1998-06-01)

Emotional Intelligence: What Is It and How Can It Transform Your Life?
by: Harpreet Singh Dhillon, et al.
Published: (2021-07-01)

Concurrent validity of intelligence assessments in children with developmental disabilities in an Asian setting: Comparison of the Kaufman brief intelligence test – Second edition with the Wechsler Intelligence Scales
by: Alison S.M. Cheng, et al.
Published: (2024-07-01)

Pengaruh Keterampilan Menyimak dan Intelligence Quotient terhadap Prestasi Belajar Siswa
by: Pien Supinah
Published: (2003-06-01)

Depression severity and verbal comprehension in children and adolescents with a major depressive episode
by: Monia Trasolini, et al.
Published: (2024-09-01)

Risk factors for Ascaris lumbricoides infection and its association with nutritional status and IQ in 14-Year old adolescents in Chitwan, Nepal
by: Rajendra Prasad Parajuli, et al.
Published: (2024-10-01)

Are Large Language Models Intelligent? Are Humans?
by: Olle Häggström
Published: (2023-08-01)

The associations between gut microbiota and fecal metabolites with intelligence quotient in preschoolers
by: Jinghua Long, et al.
Published: (2024-10-01)

Cognitive heterogeneity in major depressive disorder: classification by IQ trajectory and multimodal neuroimaging profiles
by: Xiao Yang, et al.
Published: (2025-08-01)

Autism, intelligence, language, and adaptive behavior, disentangling a complex relationship
by: Chiara Failla, et al.
Published: (2024-11-01)

A PLS-SEM Analysis of Emotional Intelligence and Construction Organisation’s Performance Nexus in South Africa
by: Lerato Millicent Aghimien, et al.
Published: (2024-10-01)

Pilot study investigating the relationship between motor skill, intelligence and perceptual reasoning and early academic achievement in children
by: Behrouz Ghorbanzadeh, et al.
Published: (2025-12-01)

ANALISIS POTENSI DIRI MAHASISWA TERHADAP MINAT MENJADI GURU MATEMATIKA
by: Rianto Pali' Datu, et al.
Published: (2022-05-01)

Artificial Intelligence-Based Large Language Models Can Facilitate Patient Education
by: Xochitl Bryson, BA, et al.
Published: (2025-08-01)

Digital chefs and intelligent cooking systems based on multimodal large language model
by: LI Xinyuan, et al.
Published: (2024-12-01)

How Emotional Intelligence and Adversity Quotient Impact Organizational Citizenship Behavior: A Meta-Analysis
by: Sulistiasih Sulistiasih, et al.
Published: (2024-12-01)

The promise and challenges of Artificial Intelligence-Large Language Models (AI-LLMs) in obstetric and gynecology
by: Khanisyah Erza Gumilar, et al.
Published: (2024-07-01)

The importance of Iq, Eq and Sq in forming the personality of a preschool mentor
by: A. Garifullina
Published: (2022-04-01)

General artificial intelligence enables network intelligence: a network operating system based on large language model
by: HUO Ru, et al.
Published: (2025-06-01)

Unlocking innovation and resilience among emergency nurses through cultural intelligence: insights from a structural equation model
by: Nadia Hassan Ali Awad, et al.
Published: (2025-07-01)

Radiology-GPT: A large language model for radiology
by: Zhengliang Liu, et al.
Published: (2025-06-01)

Cockpit-Llama: Driver Intent Prediction in Intelligent Cockpit via Large Language Model
by: Yi Chen, et al.
Published: (2024-12-01)

Research on the development of intelligent computing network for large models
by: GUO Liang, et al.
Published: (2024-06-01)

Performance of the Large Language Models in African rheumatology: a diagnostic test accuracy study of ChatGPT-4, Gemini, Copilot, and Claude artificial intelligence
by: Yannick Laurent Tchenadoyo Bayala, et al.
Published: (2025-05-01)

The Potential Clinical Utility of the Customized Large Language Model in Gastroenterology: A Pilot Study
by: Eun Jeong Gong, et al.
Published: (2024-12-01)

The ‘Implicit Intelligence’ of artificial intelligence. Investigating the potential of large language models in social science research
by: Ottorino Cappelli, et al.
Published: (2024-12-01)

Intelligent accounting question-answering robot based on a large language model and knowledge graph
by: Shi Shengyun, et al.
Published: (2025-04-01)

Opportunities and challenges in the application of large artificial intelligence models in radiology
by: Liangrui Pan, et al.
Published: (2024-06-01)

Educational Roles and Scenarios for Large Language Models: An Ethnographic Research Study of Artificial Intelligence
by: Nikša Alfirević, et al.
Published: (2024-10-01)

LLM-CDM: A Large Language Model Enhanced Cognitive Diagnosis for Intelligent Education
by: Xin Chen, et al.
Published: (2025-01-01)

Large language models meet user interfaces: The case of provisioning feedback
by: Stanislav Pozdniakov, et al.
Published: (2024-12-01)

Individualized prediction of multi-domain intelligence quotient in bipolar disorder patients using resting-state functional connectivity
by: Xiaoyu Li, et al.
Published: (2025-03-01)

Maternal oxidative stress throughout pregnancy and early childhood neurodevelopment at different stages: insights from a prospective cohort study
by: Sen He, et al.
Published: (2025-08-01)

The potential of large language models to advance precision oncology
by: Shufan Liang, et al.
Published: (2025-05-01)

Multi-agent systems powered by large language models: applications in swarm intelligence
by: Cristian Jimenez-Romero, et al.
Published: (2025-05-01)

Humanist educational approach in a world with Large Language Models (Artificial Intelligence): Reflexions and experiences from a university professor
by: Pedro Antonio López de Haro
Published: (2025-06-01)

Large Language Model and Digital Twins Empowered Asynchronous Federated Learning for Secure Data Sharing in Intelligent Labeling
by: Xuanzhu Sheng, et al.
Published: (2024-11-01)

Developing a Machine Intelligence Quotient (MIQ) for evaluating autonomous vehicle intelligence: a conceptual framework
by: Mehdi Cina, et al.
Published: (2024-10-01)

Relationship Between Perinatal and Neonatal Indices and Intelligence Quotient in Very Low Birth Weight Infants at the Age of 6 or 8 Years
by: Shu-Chi Mu, et al.
Published: (2008-04-01)