A dataset for evaluating clinical research claims in large language models
Abstract Large language models (LLMs) have the potential to enhance the verification of health claims. However, issues with hallucination and comprehension of logical statements require these models to be closely scrutinized in healthcare applications. We introduce CliniFact, a scientific claim data...
Saved in:
| Main Authors: | Boya Zhang, Alban Bornet, Anthony Yazdani, Philipp Khlebnikov, Marija Milutinovic, Hossein Rouhizadeh, Poorya Amini, Douglas Teodoro |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-01-01
|
| Series: | Scientific Data |
| Online Access: | https://doi.org/10.1038/s41597-025-04417-x |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
An Evaluation Benchmark for Adverse Drug Event Prediction from Clinical Trial Results
by: Anthony Yazdani, et al.
Published: (2025-03-01) -
A scoping review of artificial intelligence applications in clinical trial risk assessment
by: Douglas Teodoro, et al.
Published: (2025-07-01) -
Erratum to “Detection of Patients at Risk of Multidrug-Resistant Enterobacteriaceae Infection Using Graph Neural Networks: A Retrospective Study”
by: Racha Gouareb, et al.
Published: (2024-01-01) -
Automated fact-checking of climate claims with large language models
by: Markus Leippold, et al.
Published: (2025-02-01) -
The Use of Large Language Models for Marking Datasets in Natural Language Processing
by: Krystina Makoveichuk, et al.
Published: (2023-10-01)