$MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data$

MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data

Abstract Large language models (LLMs) have significantly advanced natural language understanding and demonstrated strong problem-solving abilities. Despite these successes, most LLMs still struggle with solving mathematical problems due to the intricate reasoning required. To support rigorous evalua...

Full description

Saved in:

Bibliographic Details
Main Authors:	Meng Fang, Xiangpeng Wan, Fei Lu, Fei Xing, Kai Zou
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-08-01
Series:	Scientific Data
Online Access:	https://doi.org/10.1038/s41597-025-05283-3
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Crafting math minds: A bibliometric odyssey into innovative didactical designs for learning (2006-2023)
by: Dadan Dasari, et al.
Published: (2024-02-01)

Localization of the Odyssey’s Underworld
by: Jonathan S. Burgess
Published: (2016-04-01)

American Odyssey : the United States in the twentieth century /
by: Nash, Gary B.
Published: (1999)

GREEKS AND BARBARIANS IN HOMER’S “ODYSSEY”
by: Ştefania VOICU
Published: (2013-05-01)

The Odyssey Handbook and Guide to Writing /
by: Woods, George Benjamin
Published: (1954)

The Odyssey Handbook and Guide to Writing /
by: Woods, George Benjamin
Published: (1954)

Enhancing Early Mathematical Skills Through Math Games
by: İhsan Seyit Ertem, et al.
Published: (2024-08-01)

My Odyssey through Medical Education
by: Ashok Kumar Khurana
Published: (2023-04-01)

Population genetic and phylogeographic insights into the phyllosomal odyssey
by: Matthew Iacchei
Published: (2014-03-01)

The Odyssey of Homework During the COVID-19 Pandemic
by: Eugen Bruno Ștefan
Published: (2021-08-01)

Rockin’ the Subsurface: Learning Geophysics with ‘Electromagnetic Odyssey’
by: Katya Alvarez-Molina, et al.
Published: (2024-12-01)

From Barbers to Bots: A Surgeon’s Odyssey
by: Surjeet Dwivedi, et al.
Published: (2025-05-01)

Mathematics on the blackboard! Emotional processing of math-related pictures in individuals with math anxiety
by: Rocío Linares, et al.
Published: (2025-07-01)

Response time on math skills and its association with math skills accuracy and physics course grades
by: Harish Moni Prakash, et al.
Published: (2025-04-01)

The cost of the diagnostic odyssey of patients with suspected rare diseases
by: Rick Glaubitz, et al.
Published: (2025-05-01)

Caregivers’ experiences and challenges of the diagnostic odyssey in Dravet syndrome
by: Jan Domaradzki, et al.
Published: (2025-05-01)

Odyssey of environmental and microbial interventions in maize crop improvement
by: Alok Kumar Singh, et al.
Published: (2025-01-01)

Creative industries & cultural science: A definitional odyssey
by: Potts Jason
Published: (2008-01-01)

Exploring Cryoglobulinemia's Clinical Odyssey: A Case Series
by: Shivangini Duggal, et al.
Published: (2025-04-01)

Unveiling the radiological odyssey: Navigating the interstitial with artificial intelligence
by: Anna Russo, et al.
Published: (2024-11-01)

The relationship between grade nine math national exam results, prior skills, and an interest in math
by: Triinu Kilp-Kabel, et al.
Published: (2025-04-01)

Undiagnosed Hackathons: Ending the diagnostic odyssey for individuals with rare disease
by: Helene Cederroth, et al.
Published: (2025-01-01)

The Odyssey of D.H. Lawrence: Modernism, Europe and the New World
by: Peter Marks
Published: (1999-12-01)

Maths Anxiety: The Fear Factor in the Mathematics Classroom
by: Julie Whyte, et al.
Published: (2012-04-01)

Engineering students' perceptions and actual use of AI-based math tools for solving mathematical problems
by: Kimberly F. Garcia, et al.
Published: (2025-06-01)

Decoding colorectal cancer lung metastasis: a global research odyssey
by: Xu Zhang, et al.
Published: (2025-07-01)

Technological Determinism and Singularity in Clarke & Kubrick’s 2001: A Space Odyssey
by: Cenk Tan
Published: (2025-06-01)

Reception of Ancient Plot of Odyssey in Stefan Schütz’s Play “Odysseus' Heimkehr”
by: A. S. Frolova
Published: (2025-03-01)

China’s Afghan Odyssey: From War to Prosperity in Taliban-Controlled Afghanistan
by: Amit Kumar, et al.
Published: (2024-01-01)

The long odyssey for the DEE‐CDKL5 diagnosis: A call for action
by: Kette D. Valente, et al.
Published: (2024-12-01)

A Taxonomic Odyssey: An annotated checklist of Peromyscus (Cricetidae, Rodentia) in Honduras
by: Celeste M. López, et al.
Published: (2024-11-01)

The odyssey of a judicial career in precarious times : my trials and triumphs as a three-time Chief Justice of Uganda /
by: Wambuzi, S. W. W.
Published: (2014)

Foundation Maths /
by: Croft, Anthony
Published: (2010)

Math Connections : a secondary mathematics core curriculum /
by: Berlinghoff, William P.
Published: (2006)

Enhancing Students’ Performance in Math 9 Through Math-Collab
by: Jairoh N. Taracina
Published: (2024-06-01)

A Comparison: The Science of Mathematics vs. The Science of Math
by: Georgia Southern University
Published: (2025-01-01)

Study on solving math word problem based on contrastive learning
by: ZHANG Tiancheng, et al.
Published: (2025-01-01)

The neurocognitive mechanism underlying math avoidance among math anxious people
by: Jie Liu, et al.
Published: (2025-08-01)

Understanding the Role of Cognitive Abilities and Math Anxiety in Adolescent Math Achievement
by: Lorenzo Esposito, et al.
Published: (2025-04-01)

Thus Spoke Ahuramazda: 2004: A Systemcide -on To –the-l.m.d. Odyssey
by: Nacif Labed
Published: (2024-10-01)