Performance Comparison of Large Language Models for Efficient Literature Screening
<b>Background:</b> Systematic reviewers face a growing body of biomedical literature, making early-stage article screening increasingly time-consuming. In this study, we assessed six large language models (LLMs)—OpenHermes, Flan T5, GPT-2, Claude 3 Haiku, GPT-3.5 Turbo, and GPT-4o—for th...
Saved in:
| Main Authors: | Maria Teresa Colangelo, Stefano Guizzardi, Marco Meleti, Elena Calciolari, Carlo Galli |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-05-01
|
| Series: | BioMedInformatics |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2673-7426/5/2/25 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
How to Write Effective Prompts for Screening Biomedical Literature Using Large Language Models
by: Maria Teresa Colangelo, et al.
Published: (2025-03-01) -
Large Language Models in Systematic Review Screening: Opportunities, Challenges, and Methodological Considerations
by: Carlo Galli, et al.
Published: (2025-05-01) -
A Comparative Analysis of Sentence Transformer Models for Automated Journal Recommendation Using PubMed Metadata
by: Maria Teresa Colangelo, et al.
Published: (2025-03-01) -
Validation of automated paper screening for esophagectomy systematic review using large language models
by: Rashi Ramchandani, et al.
Published: (2025-04-01) -
PROMPTHEUS: A Human-Centered Pipeline to Streamline Systematic Literature Reviews with Large Language Models
by: Joao Torres, et al.
Published: (2025-05-01)