Assessing how accurately large language models encode and apply the common European framework of reference for languages

Large Language Models (LLMs) can have a transformative effect on a variety of domains, including education, and it is therefore pressing to understand whether these models have knowledge of – or, in other words, how they have encoded – the specific pedagogical requirements of different educational d...

Full description

Saved in:
Bibliographic Details
Main Authors: Luca Benedetto, Gabrielle Gaudeau, Andrew Caines, Paula Buttery
Format: Article
Language:English
Published: Elsevier 2025-06-01
Series:Computers and Education: Artificial Intelligence
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666920X24001565
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841545524561313792
author Luca Benedetto
Gabrielle Gaudeau
Andrew Caines
Paula Buttery
author_facet Luca Benedetto
Gabrielle Gaudeau
Andrew Caines
Paula Buttery
author_sort Luca Benedetto
collection DOAJ
description Large Language Models (LLMs) can have a transformative effect on a variety of domains, including education, and it is therefore pressing to understand whether these models have knowledge of – or, in other words, how they have encoded – the specific pedagogical requirements of different educational domains, and whether they use this when performing educational tasks. In this work, we propose an approach to evaluate the knowledge – or encoding – that the LLMs have of the Common European Framework of Reference for Languages (CEFR), and use it to evaluate five modern LLMs. Our study shows that the suite of tasks we propose is quite challenging for all the LLMs, and they often provide results which are not satisfactory and would be unusable in educational applications, suggesting that – even if they encode some information about the CEFR – this knowledge is not really leveraged when performing downstream tasks.
format Article
id doaj-art-9be6d70d787f4da39bddfb24cecc070a
institution Kabale University
issn 2666-920X
language English
publishDate 2025-06-01
publisher Elsevier
record_format Article
series Computers and Education: Artificial Intelligence
spelling doaj-art-9be6d70d787f4da39bddfb24cecc070a2025-01-12T05:26:11ZengElsevierComputers and Education: Artificial Intelligence2666-920X2025-06-018100353Assessing how accurately large language models encode and apply the common European framework of reference for languagesLuca Benedetto0Gabrielle Gaudeau1Andrew Caines2Paula Buttery3Corresponding author.; ALTA Institute, Department of Computer Science & Technology, University of Cambridge, Cambridge, United KingdomALTA Institute, Department of Computer Science & Technology, University of Cambridge, Cambridge, United KingdomALTA Institute, Department of Computer Science & Technology, University of Cambridge, Cambridge, United KingdomALTA Institute, Department of Computer Science & Technology, University of Cambridge, Cambridge, United KingdomLarge Language Models (LLMs) can have a transformative effect on a variety of domains, including education, and it is therefore pressing to understand whether these models have knowledge of – or, in other words, how they have encoded – the specific pedagogical requirements of different educational domains, and whether they use this when performing educational tasks. In this work, we propose an approach to evaluate the knowledge – or encoding – that the LLMs have of the Common European Framework of Reference for Languages (CEFR), and use it to evaluate five modern LLMs. Our study shows that the suite of tasks we propose is quite challenging for all the LLMs, and they often provide results which are not satisfactory and would be unusable in educational applications, suggesting that – even if they encode some information about the CEFR – this knowledge is not really leveraged when performing downstream tasks.http://www.sciencedirect.com/science/article/pii/S2666920X24001565Large language modelsLanguage learningCommon European Framework of Reference for Languages
spellingShingle Luca Benedetto
Gabrielle Gaudeau
Andrew Caines
Paula Buttery
Assessing how accurately large language models encode and apply the common European framework of reference for languages
Computers and Education: Artificial Intelligence
Large language models
Language learning
Common European Framework of Reference for Languages
title Assessing how accurately large language models encode and apply the common European framework of reference for languages
title_full Assessing how accurately large language models encode and apply the common European framework of reference for languages
title_fullStr Assessing how accurately large language models encode and apply the common European framework of reference for languages
title_full_unstemmed Assessing how accurately large language models encode and apply the common European framework of reference for languages
title_short Assessing how accurately large language models encode and apply the common European framework of reference for languages
title_sort assessing how accurately large language models encode and apply the common european framework of reference for languages
topic Large language models
Language learning
Common European Framework of Reference for Languages
url http://www.sciencedirect.com/science/article/pii/S2666920X24001565
work_keys_str_mv AT lucabenedetto assessinghowaccuratelylargelanguagemodelsencodeandapplythecommoneuropeanframeworkofreferenceforlanguages
AT gabriellegaudeau assessinghowaccuratelylargelanguagemodelsencodeandapplythecommoneuropeanframeworkofreferenceforlanguages
AT andrewcaines assessinghowaccuratelylargelanguagemodelsencodeandapplythecommoneuropeanframeworkofreferenceforlanguages
AT paulabuttery assessinghowaccuratelylargelanguagemodelsencodeandapplythecommoneuropeanframeworkofreferenceforlanguages