A benchmark for evaluating crisis information generation capabilities in LLMs

A benchmark for evaluating crisis information generation capabilities in LLMs

Introduction. Large language models (LLMs) have become increasingly significant in crisis information management due to their advanced natural language processing capabilities. This study aims to develop a comprehensive evaluation benchmark to assess the effectiveness of LLMs in generating crisis i...

Full description

Saved in:

Bibliographic Details
Main Authors:	Ruilian Han, Lu An, Wei Zhou, Gang Li
Format:	Article
Language:	English
Published:	University of Borås 2025-03-01
Series:	Information Research: An International Electronic Journal
Subjects:	LLMs Crisis informatics LLMs evaluation Information generation Evaluation benchmark
Online Access:	https://publicera.kb.se/ir/article/view/47518
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

RVBench: Role values benchmark for role-playing LLMs
by: Ye Wang, et al.
Published: (2025-08-01)

Systematic Analysis of Retrieval-Augmented Generation-Based LLMs for Medical Chatbot Applications
by: Arunabh Bora, et al.
Published: (2024-10-01)

TSTBench: A Comprehensive Benchmark for Text Style Transfer
by: Yifei Xie, et al.
Published: (2025-05-01)

LLMs in the Generation of Seismic Alert Communiqués
by: Oscar Peña-Cáceres, et al.
Published: (2025-05-01)

Measuring and Improving the Efficiency of Python Code Generated by LLMs Using CoT Prompting and Fine-Tuning
by: Ramya Jonnala, et al.
Published: (2025-01-01)

Benchmarking pre-trained text embedding models in aligning built asset information
by: Mehrzad Shahinmoghadam, et al.
Published: (2025-07-01)

WEISS: Wasserstein efficient sampling strategy for LLMs in drug design
by: Riccardo Tedoldi, et al.
Published: (2025-01-01)

LLMs in Education: Evaluation GPT and BERT Models in Student Comment Classification
by: Anabel Pilicita, et al.
Published: (2025-05-01)

The Use of Large Language Models for Translating Buddhist Texts from Classical Chinese to Modern English: An Analysis and Evaluation with ChatGPT 4, ERNIE Bot 4, and Gemini Advanced
by: Xiang Wei
Published: (2024-12-01)

Leveraging LLMs for COVID-19 Fake News Generation and Detection: A Comparative Analysis on Twitter Data
by: Hong N. Dao, et al.
Published: (2025-01-01)

Toward HydroLLM: a benchmark dataset for hydrology-specific knowledge assessment for large language models
by: Dilara Kizilkaya, et al.
Published: (2025-01-01)

Extracting airline emission KPIs from sustainability reports using large language models (LLMs)
by: Luis Martín-Domingo, et al.
Published: (2025-09-01)

LLMs in Action: Robust Metrics for Evaluating Automated Ontology Annotation Systems
by: Ali Noori, et al.
Published: (2025-03-01)

Reporting guideline for Chatbot Health Advice studies: the CHART statement
by: Bright Huo, et al.
Published: (2025-08-01)

Arch-Eval benchmark for assessing chinese architectural domain knowledge in large language models
by: Jie Wu, et al.
Published: (2025-04-01)

LLMs in Cyber Security: Bridging Practice and Education
by: Hany F. Atlam
Published: (2025-07-01)

The fluency-based semantic network of LLMs differs from humans
by: Ye Wang, et al.
Published: (2025-03-01)

Leveraging LLMs for Non-Security Experts in Threat Hunting: Detecting Living off the Land Techniques
by: Antreas Konstantinou, et al.
Published: (2025-03-01)

An Evaluation of LLMs and Google Translate for Translation of Selected Indian Languages via Sentiment and Semantic Analyses
by: Rohitash Chandra, et al.
Published: (2025-01-01)

HierLabelNet: A Two-Stage LLMs Framework with Data Augmentation and Label Selection for Geographic Text Classification
by: Zugang Chen, et al.
Published: (2025-07-01)

Towards automated phenotype definition extraction using large language models
by: Ramya Tekumalla, et al.
Published: (2024-10-01)

What We Know About the Role of Large Language Models for Medical Synthetic Dataset Generation
by: Larissa Montenegro, et al.
Published: (2025-05-01)

Exploring Alternative Microservice Decompositions using Data-driven Techniques and LLMs
by: Ana Martínez Saucedo, et al.
Published: (2025-05-01)

A Bibliometric Exposition and Review on Leveraging LLMs for Programming Education
by: Joanah Pwanedo Amos, et al.
Published: (2025-01-01)

Finding Your Voice: Using Generative AI to Help International Students Improve Their Writing
by: Leon Sterling, et al.
Published: (2025-04-01)

Open Source HBIM and OpenAI: Review and New Analyses on LLMs Integration
by: Filippo Diara
Published: (2025-04-01)

AutoTA: A Dynamic Intent-Based Virtual Teaching Assistant for Students Using Open Source LLMs
by: Rajashree Dahal, et al.
Published: (2025-01-01)

Automated Code Comments Generation Using Large Language Models: Empirical Evaluation of T5 and BART
by: Dhan Prasad Ghale, et al.
Published: (2025-01-01)

A scalable framework for evaluating multiple language models through cross-domain generation and hallucination detection
by: Sorup Chakraborty, et al.
Published: (2025-08-01)

Leveraging RAG and LLMs for Access Control Policy Extraction From User Stories in Agile Software Development
by: Sara Aboukadri, et al.
Published: (2025-01-01)

Combining the Strengths of LLMs and Persuasive Technology to Combat Cyberhate
by: Malik Almaliki, et al.
Published: (2025-05-01)

Generative AI in consumer health: leveraging large language models for health literacy and clinical safety with a digital health framework
by: Annemarie K. Tilton, et al.
Published: (2025-08-01)

A COMPREHENSIVE REVIEW OF GENERATIVE ARTIFICIAL INTELLIGENCE APPLICATIONS IN DATA VISUALIZATION
by: Luong Thi Minh Hue*, Nguyen The Vinh, Nguyen Van Viet, Nguyen Huu Khanh, Nguyen Kim Son, Duong Thuy Huong
Published: (2025-06-01)

Motivation of University Students to Use LLMs to Assist with Online Consumption of Sustainable Products: An Analysis Based on a Hybrid SEM–ANN Approach
by: Junjie Yu, et al.
Published: (2025-07-01)

Confidence-Based Knowledge Distillation to Reduce Training Costs and Carbon Footprint for Low-Resource Neural Machine Translation
by: Maria Zafar, et al.
Published: (2025-07-01)

Digital Friends and Empathy Blindness
by: Bangsgaard Alberte Romme, et al.
Published: (2025-04-01)

Evaluating simulated teaching audio for teacher trainees using RAG and local LLMs
by: Ke Fang, et al.
Published: (2025-01-01)

Generative AI and the Metaverse: A Scoping Review of Ethical and Legal Challenges
by: Aliya Tabassum, et al.
Published: (2025-01-01)

Addressing Activation Outliers in LLMs: A Systematic Review of Post-Training Quantization Techniques
by: Patrik Czako, et al.
Published: (2025-01-01)

Strategies for Enhancing Card Game Analysis: Exploring Large Language Models in Imperfect Information Settings
by: Feihu Ma, et al.
Published: (2025-01-01)