Automatic Classification of Difficulty of Texts From Eye Gaze and Physiological Measures of L2 English Speakers

Reading is an essential method for adults to learn new languages, but difficulty reading texts in a foreign language can increase learners’ anxiety. Identifying text difficulty from the reader’s perspective can aid language learning by tailoring texts to readers’ nee...

Full description

Saved in:
Bibliographic Details
Main Authors: Javier Melo, Leigh Fernandez, Shoya Ishimaru
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10858735/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823859637375467520
author Javier Melo
Leigh Fernandez
Shoya Ishimaru
author_facet Javier Melo
Leigh Fernandez
Shoya Ishimaru
author_sort Javier Melo
collection DOAJ
description Reading is an essential method for adults to learn new languages, but difficulty reading texts in a foreign language can increase learners&#x2019; anxiety. Identifying text difficulty from the reader&#x2019;s perspective can aid language learning by tailoring texts to readers&#x2019; needs. There is little research focusing on L2 speakers or using a multimodal approach, i.e., using multiple sensors, to detect subjective difficulty. In this study (<inline-formula> <tex-math notation="LaTeX">$N=30$ </tex-math></inline-formula>) we determined L2 speakers&#x2019; subjective difficulty while reading using language proficiency and objective text difficulty, combined with physiological data. We compared machine learning classifiers combining eye, skin and heart sensor data against models using each modality separately. Additionally, we assessed the effect on model performance of shifting the data to account for delayed physiological responses. The models detected 3 levels of subjective difficulty (low, medium, high) and were evaluated using leave-one-participant-out (LoPo) and leave-one-document-out (LoDo) cross-validation. The results showed acceptable levels of generalization to new participants (<inline-formula> <tex-math notation="LaTeX">$Acc_{LoPo} = 0.434$ </tex-math></inline-formula>) and documents (<inline-formula> <tex-math notation="LaTeX">$Acc_{LoDo} = 0.521$ </tex-math></inline-formula>). Combining sensor data from all modalities improved predictions in both LoDo and LoPo cross-validation, compared to each modality in isolation. Shifting the data to account for physiological response delay did not improve model performance compared to not shifting the data. These findings support refining subjective difficulty detection models and their implementation in adaptive language learning systems. Finally, this work contributes to the field of cognitive science and technology by laying the foundation for innovative approaches to cognitive state detection.
format Article
id doaj-art-b8cb8d312a474348afa8c19fd4522446
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-b8cb8d312a474348afa8c19fd45224462025-02-11T00:01:32ZengIEEEIEEE Access2169-35362025-01-0113245552457510.1109/ACCESS.2025.353715610858735Automatic Classification of Difficulty of Texts From Eye Gaze and Physiological Measures of L2 English SpeakersJavier Melo0https://orcid.org/0000-0003-2235-5243Leigh Fernandez1https://orcid.org/0000-0002-1635-2671Shoya Ishimaru2https://orcid.org/0000-0002-5374-1510German Research Center for Artificial Intelligence (DFKI), Kaiserslautern, GermanyPsycholinguistics and Language Development, University of Kaiserslautern-Landau, Kaiserslautern, GermanyGraduate School of Informatics, Osaka Metropolitan University, Sakai, JapanReading is an essential method for adults to learn new languages, but difficulty reading texts in a foreign language can increase learners&#x2019; anxiety. Identifying text difficulty from the reader&#x2019;s perspective can aid language learning by tailoring texts to readers&#x2019; needs. There is little research focusing on L2 speakers or using a multimodal approach, i.e., using multiple sensors, to detect subjective difficulty. In this study (<inline-formula> <tex-math notation="LaTeX">$N=30$ </tex-math></inline-formula>) we determined L2 speakers&#x2019; subjective difficulty while reading using language proficiency and objective text difficulty, combined with physiological data. We compared machine learning classifiers combining eye, skin and heart sensor data against models using each modality separately. Additionally, we assessed the effect on model performance of shifting the data to account for delayed physiological responses. The models detected 3 levels of subjective difficulty (low, medium, high) and were evaluated using leave-one-participant-out (LoPo) and leave-one-document-out (LoDo) cross-validation. The results showed acceptable levels of generalization to new participants (<inline-formula> <tex-math notation="LaTeX">$Acc_{LoPo} = 0.434$ </tex-math></inline-formula>) and documents (<inline-formula> <tex-math notation="LaTeX">$Acc_{LoDo} = 0.521$ </tex-math></inline-formula>). Combining sensor data from all modalities improved predictions in both LoDo and LoPo cross-validation, compared to each modality in isolation. Shifting the data to account for physiological response delay did not improve model performance compared to not shifting the data. These findings support refining subjective difficulty detection models and their implementation in adaptive language learning systems. Finally, this work contributes to the field of cognitive science and technology by laying the foundation for innovative approaches to cognitive state detection.https://ieeexplore.ieee.org/document/10858735/Cognitive loadelectrodermal activityeye-trackinghuman-computer interactionL2 English speakers
spellingShingle Javier Melo
Leigh Fernandez
Shoya Ishimaru
Automatic Classification of Difficulty of Texts From Eye Gaze and Physiological Measures of L2 English Speakers
IEEE Access
Cognitive load
electrodermal activity
eye-tracking
human-computer interaction
L2 English speakers
title Automatic Classification of Difficulty of Texts From Eye Gaze and Physiological Measures of L2 English Speakers
title_full Automatic Classification of Difficulty of Texts From Eye Gaze and Physiological Measures of L2 English Speakers
title_fullStr Automatic Classification of Difficulty of Texts From Eye Gaze and Physiological Measures of L2 English Speakers
title_full_unstemmed Automatic Classification of Difficulty of Texts From Eye Gaze and Physiological Measures of L2 English Speakers
title_short Automatic Classification of Difficulty of Texts From Eye Gaze and Physiological Measures of L2 English Speakers
title_sort automatic classification of difficulty of texts from eye gaze and physiological measures of l2 english speakers
topic Cognitive load
electrodermal activity
eye-tracking
human-computer interaction
L2 English speakers
url https://ieeexplore.ieee.org/document/10858735/
work_keys_str_mv AT javiermelo automaticclassificationofdifficultyoftextsfromeyegazeandphysiologicalmeasuresofl2englishspeakers
AT leighfernandez automaticclassificationofdifficultyoftextsfromeyegazeandphysiologicalmeasuresofl2englishspeakers
AT shoyaishimaru automaticclassificationofdifficultyoftextsfromeyegazeandphysiologicalmeasuresofl2englishspeakers