A comparative analysis of encoder only and decoder only models for challenging LLM-generated STEM MCQs using a self-evaluation approach

A comparative analysis of encoder only and decoder only models for challenging LLM-generated STEM MCQs using a self-evaluation approach

Large Language Models (LLMs) have demonstrated impressive capabilities in various tasks, including Multiple-Choice Question Answering (MCQA) evaluated on benchmark datasets with few-shot prompting. Given the absence of benchmark Science, Technology, Engineering, and Mathematics (STEM) datasets on Mu...

Full description

Saved in:

Bibliographic Details
Main Authors:	Ghada Soliman, Ph.D., Hozaifa Zaki, Mohamed Kilany
Format:	Article
Language:	English
Published:	Elsevier 2025-03-01
Series:	Natural Language Processing Journal
Subjects:	NLP LLM SLM Self-evaluation MCQ
Online Access:	http://www.sciencedirect.com/science/article/pii/S294971912500007X
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Exploration of the Efficiency of SLM-Enabled Platforms for Everyday Tasks
by: Volodymyr Rusinov, et al.
Published: (2025-04-01)

Effects of a long term faculty development program on improvement in quality of MCQs: an impact evaluation study
by: Rukhsana Ayub, et al.
Published: (2025-04-01)

LLM-Based Doppelgänger Models: Leveraging Synthetic Data for Human-Like Responses in Survey Simulations
by: Suhyun Cho, et al.
Published: (2024-01-01)

Comparative analysis of NLP-driven MCQ generators from text sources
by: Asmae Azzi, et al.
Published: (2025-12-01)

COLLUSION DETECTION SOFTWARE IN ONLINE MULTIPLE CHOICE EXAMINATIONS – A REVIEW
by: Srinivasa Jayachandra
Published: (2022-12-01)

Automatic extraction of SmPC document for IDMP data model construction using foundation LLM and RAG: a preliminary experiment for pharmaceutical regulatory affairs
by: Hocine Kadi, et al.
Published: (2025-08-01)

Intelligent ESG Evaluation for Construction Enterprises in China: An LLM-Based Model
by: Binqing Cai, et al.
Published: (2025-07-01)

A model of ensuring LLM cybersecurity
by: Oleksii Neretin, et al.
Published: (2025-05-01)

ReACT_OCRS: An AI-Driven Anonymous Online Reporting System Using Synergized Reasoning and Acting in Language Models
by: Amir Aboubakr Shaker Mahmoud, et al.
Published: (2025-01-01)

Charting the Growth of Text Summarisation: A Data-Driven Exploration of Research Trends and Technological Advancements
by: Anukriti Kaushal, et al.
Published: (2024-12-01)

Leveraging Large Language Models for Enhancing Literature-Based Discovery
by: Ikbal Taleb, et al.
Published: (2024-10-01)

LLM-Driven APT Detection for 6G Wireless Networks: A Systematic Review and Taxonomy
by: Muhammed Golec, et al.
Published: (2025-01-01)

Interferometric Surface Analysis of a Phase-Only Spatial Light Modulator for Surface Deformation Compensation
by: Rania M. Abdelazeem, et al.
Published: (2025-03-01)

You believe your LLM is not delusional? Think again! a study of LLM hallucination on foundation models under perturbation
by: Anirban Saha, et al.
Published: (2025-05-01)

Survey and Evaluation of Converging Architecture in LLMs Based on Footsteps of Operations
by: Seongho Kim, et al.
Published: (2025-01-01)

Convolutional Versus Large Language Models for Software Log Classification in Edge-Deployable Cellular Network Testing
by: Achintha Ihalage, et al.
Published: (2025-01-01)

Transforming dental diagnostics with artificial intelligence: advanced integration of ChatGPT and large language models for patient care
by: Masoumeh Farhadi Nia, et al.
Published: (2025-01-01)

System Approach to the Combined Use of Large Language Models and Classical Models in Foresight Tasks
by: Володимир Савастьянов, et al.
Published: (2024-12-01)

Entropy-Guided KV Caching for Efficient LLM Inference
by: Heekyum Kim, et al.
Published: (2025-07-01)

Generative AI in cybersecurity: A comprehensive review of LLM applications and vulnerabilities
by: Mohamed Amine Ferrag, et al.
Published: (2025-01-01)

LLM Performance in Low-Resource Languages: Selecting an Optimal Model for Migrant Integration Support in Greek
by: Alexandros Tassios, et al.
Published: (2025-05-01)

LLM technologies and information search
by: Lin Liu, et al.
Published: (2024-11-01)

Retrospective Comparative Analysis of Prostate Cancer In-Basket Messages: Responses From Closed-Domain Large Language Models Versus Clinical Teams
by: Yuexing Hao, MS, et al.
Published: (2025-03-01)

Exploring people's perceptions of LLM-generated advice
by: Joel Wester, et al.
Published: (2024-08-01)

Using Graph-Based Maximum Independent Sets with Large Language Models for Extractive Text Summarization
by: Cengiz Hark
Published: (2025-06-01)

Research on the Construction and Application of Earthquake Emergency Information Knowledge Graph Based on Large Language Models
by: Wentao Zhou, et al.
Published: (2025-01-01)

LLM Hallucination: The Curse That Cannot Be Broken
by: Hussein Al-Mahmood
Published: (2025-08-01)

XSShield: Defending Against Stored XSS Attacks Using LLM-Based Semantic Understanding
by: Yuan Zhou, et al.
Published: (2025-03-01)

Mapping Machine Learning Trends in Chemistry Research using LLM with Multi-Turn Prompting
by: Andreo Yudertha, et al.
Published: (2025-03-01)

LLM-as-a-Judge: automated evaluation of search query parsing using large language models
by: Mehmet Selman Baysan, et al.
Published: (2025-07-01)

LLM-driven semantic explanations for soil moisture prediction models
by: Bamory Ahmed Toru Koné, et al.
Published: (2025-12-01)

Detecting LLM-assisted writing in scientific communication: Are we there yet?
by: Lazebnik Teddy, et al.
Published: (2024-07-01)

Extração de Informação Aberta com LLM para a Língua Portuguesa
by: Bruno Cabral, et al.
Published: (2025-01-01)

LLM-AIDSim: LLM-Enhanced Agent-Based Influence Diffusion Simulation in Social Networks
by: Lan Zhang, et al.
Published: (2025-01-01)

Understanding the effects of human-written paraphrases in LLM-generated text detection
by: Hiu Ting Lau, et al.
Published: (2025-06-01)

Aurel_AI: Automating an Institutional Help Desk Using an LLM Chatbot
by: Diego Ordóñez-Camacho, et al.
Published: (2024-10-01)

Evaluating Large Language Models for Optimized Intent Translation and Contradiction Detection Using KNN in IBN
by: Muhammad Asif, et al.
Published: (2025-01-01)

Large Language Models (LLMs) and Causality Extraction from Text
by: Wlodek Zadrozny
Published: (2025-05-01)

CoReaAgents: A Collaboration and Reasoning Framework Based on LLM-Powered Agents for Complex Reasoning Tasks
by: Zhonghe Han, et al.
Published: (2025-05-01)

Exploration and selection of LLM models for financial text simplification
by: Bertha C Brenes-Brenes, et al.
Published: (2024-09-01)