Construction of a Multimodal Dataset for Emergency Event Identification and Classification

[Purpose/Significance] Rich Internet data provide a multi-dimensional perspective for understanding emergencies, and multimodal emergency classification methods have emerged. However, the existing multimodal datasets of emergencies are not only scarce, but also lacking in diversity in categories, wh...

Full description

Saved in:
Bibliographic Details
Main Author: Yifan ZHANG, Zuqin CHEN, Jike GE, Mingkun HE, Jie TAN
Format: Article
Language:zho
Published: Editorial Department of Journal of Library and Information Science in Agriculture 2024-10-01
Series:Nongye tushu qingbao xuebao
Subjects:
Online Access:http://nytsqb.aiijournal.com/fileup/1002-1248/PDF/1741772376931-942972400.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849761812440940544
author Yifan ZHANG, Zuqin CHEN, Jike GE, Mingkun HE, Jie TAN
author_facet Yifan ZHANG, Zuqin CHEN, Jike GE, Mingkun HE, Jie TAN
author_sort Yifan ZHANG, Zuqin CHEN, Jike GE, Mingkun HE, Jie TAN
collection DOAJ
description [Purpose/Significance] Rich Internet data provide a multi-dimensional perspective for understanding emergencies, and multimodal emergency classification methods have emerged. However, the existing multimodal datasets of emergencies are not only scarce, but also lacking in diversity in categories, which is not enough to support related research, and greatly affects the progress of subsequent research. Compared with previous public datasets, the dataset constructed in this paper has richer categories and more improved modalities. This dataset solves the key gaps in the availability and diversity of multimodal datasets of emergencies. It not only expands the category range, but also provides more detailed classification in the natural disaster category, which is crucial for developing robust and accurate multimodal classification models. [Method/Process] An emergency event dataset (MEED) based on multimodal information was constructed, which contains data from five categories: accident disasters, public health, social security, natural disasters, and non-emergency events. The natural disaster data are divided into seven subcategories: geological disasters, biological disasters, drought disasters, marine disasters, meteorological disasters, earthquake disasters, and forest and grassland fires. [Results/Conclusions] The existing emergency classification methods were analyzed and validated on the emergency public dataset and MEED. The results showed that MEED helped improve the performance of multimodal models by more than 10% compared with the currently available emergency datasets. The results show that the improvement in model performance highlights the value of MEED in promoting emergency management and response research and applications. The dataset enables researchers and practitioners to better understand the complexity of emergencies and develop more effective prevention, mitigation, and response strategies. The improvement in model performance also shows that multimodal methods are a promising direction for analyzing emergency events because it leverages the advantages of different types of data to achieve higher accuracy and reliability in classification tasks. The creation of MEED is a major advancement in the field of emergency management, providing researchers with a valuable resource and potentially leading to the development of more sophisticated tools for responding to emergencies. However, the dataset still has certain limitations. Over time, the number of emergencies on the Internet continues to grow, which requires us to continuously update the dataset to adapt to new situations. The size of the dataset largely determines the performance of the classification model. The class imbalance problem of the emergency dataset constructed in this paper needs to be solved. In future research, we will continue to update and maintain the dataset in a timely manner to address these issues.
format Article
id doaj-art-63e1d413a4474dfd96906da7e2dae068
institution DOAJ
issn 1002-1248
language zho
publishDate 2024-10-01
publisher Editorial Department of Journal of Library and Information Science in Agriculture
record_format Article
series Nongye tushu qingbao xuebao
spelling doaj-art-63e1d413a4474dfd96906da7e2dae0682025-08-20T03:05:53ZzhoEditorial Department of Journal of Library and Information Science in AgricultureNongye tushu qingbao xuebao1002-12482024-10-013610768510.13998/j.cnki.issn1002-1248.24-0624Construction of a Multimodal Dataset for Emergency Event Identification and ClassificationYifan ZHANG, Zuqin CHEN, Jike GE, Mingkun HE, Jie TAN0Department of Computer Science and Engineering, Chongqing University of Science and Technology, Chongqing 401331[Purpose/Significance] Rich Internet data provide a multi-dimensional perspective for understanding emergencies, and multimodal emergency classification methods have emerged. However, the existing multimodal datasets of emergencies are not only scarce, but also lacking in diversity in categories, which is not enough to support related research, and greatly affects the progress of subsequent research. Compared with previous public datasets, the dataset constructed in this paper has richer categories and more improved modalities. This dataset solves the key gaps in the availability and diversity of multimodal datasets of emergencies. It not only expands the category range, but also provides more detailed classification in the natural disaster category, which is crucial for developing robust and accurate multimodal classification models. [Method/Process] An emergency event dataset (MEED) based on multimodal information was constructed, which contains data from five categories: accident disasters, public health, social security, natural disasters, and non-emergency events. The natural disaster data are divided into seven subcategories: geological disasters, biological disasters, drought disasters, marine disasters, meteorological disasters, earthquake disasters, and forest and grassland fires. [Results/Conclusions] The existing emergency classification methods were analyzed and validated on the emergency public dataset and MEED. The results showed that MEED helped improve the performance of multimodal models by more than 10% compared with the currently available emergency datasets. The results show that the improvement in model performance highlights the value of MEED in promoting emergency management and response research and applications. The dataset enables researchers and practitioners to better understand the complexity of emergencies and develop more effective prevention, mitigation, and response strategies. The improvement in model performance also shows that multimodal methods are a promising direction for analyzing emergency events because it leverages the advantages of different types of data to achieve higher accuracy and reliability in classification tasks. The creation of MEED is a major advancement in the field of emergency management, providing researchers with a valuable resource and potentially leading to the development of more sophisticated tools for responding to emergencies. However, the dataset still has certain limitations. Over time, the number of emergencies on the Internet continues to grow, which requires us to continuously update the dataset to adapt to new situations. The size of the dataset largely determines the performance of the classification model. The class imbalance problem of the emergency dataset constructed in this paper needs to be solved. In future research, we will continue to update and maintain the dataset in a timely manner to address these issues.http://nytsqb.aiijournal.com/fileup/1002-1248/PDF/1741772376931-942972400.pdfincidents|multimodal|dataset|deep learning|data acquisition|data annotations
spellingShingle Yifan ZHANG, Zuqin CHEN, Jike GE, Mingkun HE, Jie TAN
Construction of a Multimodal Dataset for Emergency Event Identification and Classification
Nongye tushu qingbao xuebao
incidents|multimodal|dataset|deep learning|data acquisition|data annotations
title Construction of a Multimodal Dataset for Emergency Event Identification and Classification
title_full Construction of a Multimodal Dataset for Emergency Event Identification and Classification
title_fullStr Construction of a Multimodal Dataset for Emergency Event Identification and Classification
title_full_unstemmed Construction of a Multimodal Dataset for Emergency Event Identification and Classification
title_short Construction of a Multimodal Dataset for Emergency Event Identification and Classification
title_sort construction of a multimodal dataset for emergency event identification and classification
topic incidents|multimodal|dataset|deep learning|data acquisition|data annotations
url http://nytsqb.aiijournal.com/fileup/1002-1248/PDF/1741772376931-942972400.pdf
work_keys_str_mv AT yifanzhangzuqinchenjikegemingkunhejietan constructionofamultimodaldatasetforemergencyeventidentificationandclassification