Assessing how accurately large language models encode and apply the common European framework of reference for languages

Large Language Models (LLMs) can have a transformative effect on a variety of domains, including education, and it is therefore pressing to understand whether these models have knowledge of – or, in other words, how they have encoded – the specific pedagogical requirements of different educational d...

Full description

Saved in:

Bibliographic Details
Main Authors:	Luca Benedetto, Gabrielle Gaudeau, Andrew Caines, Paula Buttery
Format:	Article
Language:	English
Published:	Elsevier 2025-06-01
Series:	Computers and Education: Artificial Intelligence
Subjects:	Large language models Language learning Common European Framework of Reference for Languages
Online Access:	http://www.sciencedirect.com/science/article/pii/S2666920X24001565
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841545524561313792
author	Luca Benedetto Gabrielle Gaudeau Andrew Caines Paula Buttery
author_facet	Luca Benedetto Gabrielle Gaudeau Andrew Caines Paula Buttery
author_sort	Luca Benedetto
collection	DOAJ
description	Large Language Models (LLMs) can have a transformative effect on a variety of domains, including education, and it is therefore pressing to understand whether these models have knowledge of – or, in other words, how they have encoded – the specific pedagogical requirements of different educational domains, and whether they use this when performing educational tasks. In this work, we propose an approach to evaluate the knowledge – or encoding – that the LLMs have of the Common European Framework of Reference for Languages (CEFR), and use it to evaluate five modern LLMs. Our study shows that the suite of tasks we propose is quite challenging for all the LLMs, and they often provide results which are not satisfactory and would be unusable in educational applications, suggesting that – even if they encode some information about the CEFR – this knowledge is not really leveraged when performing downstream tasks.
format	Article
id	doaj-art-9be6d70d787f4da39bddfb24cecc070a
institution	Kabale University
issn	2666-920X
language	English
publishDate	2025-06-01
publisher	Elsevier
record_format	Article
series	Computers and Education: Artificial Intelligence
spelling	doaj-art-9be6d70d787f4da39bddfb24cecc070a2025-01-12T05:26:11ZengElsevierComputers and Education: Artificial Intelligence2666-920X2025-06-018100353Assessing how accurately large language models encode and apply the common European framework of reference for languagesLuca Benedetto0Gabrielle Gaudeau1Andrew Caines2Paula Buttery3Corresponding author.; ALTA Institute, Department of Computer Science & Technology, University of Cambridge, Cambridge, United KingdomALTA Institute, Department of Computer Science & Technology, University of Cambridge, Cambridge, United KingdomALTA Institute, Department of Computer Science & Technology, University of Cambridge, Cambridge, United KingdomALTA Institute, Department of Computer Science & Technology, University of Cambridge, Cambridge, United KingdomLarge Language Models (LLMs) can have a transformative effect on a variety of domains, including education, and it is therefore pressing to understand whether these models have knowledge of – or, in other words, how they have encoded – the specific pedagogical requirements of different educational domains, and whether they use this when performing educational tasks. In this work, we propose an approach to evaluate the knowledge – or encoding – that the LLMs have of the Common European Framework of Reference for Languages (CEFR), and use it to evaluate five modern LLMs. Our study shows that the suite of tasks we propose is quite challenging for all the LLMs, and they often provide results which are not satisfactory and would be unusable in educational applications, suggesting that – even if they encode some information about the CEFR – this knowledge is not really leveraged when performing downstream tasks.http://www.sciencedirect.com/science/article/pii/S2666920X24001565Large language modelsLanguage learningCommon European Framework of Reference for Languages
spellingShingle	Luca Benedetto Gabrielle Gaudeau Andrew Caines Paula Buttery Assessing how accurately large language models encode and apply the common European framework of reference for languages Computers and Education: Artificial Intelligence Large language models Language learning Common European Framework of Reference for Languages
title	Assessing how accurately large language models encode and apply the common European framework of reference for languages
title_full	Assessing how accurately large language models encode and apply the common European framework of reference for languages
title_fullStr	Assessing how accurately large language models encode and apply the common European framework of reference for languages
title_full_unstemmed	Assessing how accurately large language models encode and apply the common European framework of reference for languages
title_short	Assessing how accurately large language models encode and apply the common European framework of reference for languages
title_sort	assessing how accurately large language models encode and apply the common european framework of reference for languages
topic	Large language models Language learning Common European Framework of Reference for Languages
url	http://www.sciencedirect.com/science/article/pii/S2666920X24001565
work_keys_str_mv	AT lucabenedetto assessinghowaccuratelylargelanguagemodelsencodeandapplythecommoneuropeanframeworkofreferenceforlanguages AT gabriellegaudeau assessinghowaccuratelylargelanguagemodelsencodeandapplythecommoneuropeanframeworkofreferenceforlanguages AT andrewcaines assessinghowaccuratelylargelanguagemodelsencodeandapplythecommoneuropeanframeworkofreferenceforlanguages AT paulabuttery assessinghowaccuratelylargelanguagemodelsencodeandapplythecommoneuropeanframeworkofreferenceforlanguages

Assessing how accurately large language models encode and apply the common European framework of reference for languages

Similar Items