Modeling of injury severity of distracted driving accident using statistical and machine learning models.

Distracted Driving (DD) is one of the global causes of high mortality and fatality in road traffic accidents. The increase in the number of distracted driving accidents (DDAs) is one of the concerns among transportation communities. The present study aimed to examine the individual and interacted ef...

Full description

Saved in:
Bibliographic Details
Main Authors: Neero Gumsar Sorum, Martina Gumsar Sorum
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0326113
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849688153641713664
author Neero Gumsar Sorum
Martina Gumsar Sorum
author_facet Neero Gumsar Sorum
Martina Gumsar Sorum
author_sort Neero Gumsar Sorum
collection DOAJ
description Distracted Driving (DD) is one of the global causes of high mortality and fatality in road traffic accidents. The increase in the number of distracted driving accidents (DDAs) is one of the concerns among transportation communities. The present study aimed to examine the individual and interacted effects of the influential factors on the injury severity of the DDAs using the Binary Logistic Regression (BLR) method, and at the same, to select the best machine learning (ML) model in predicting the injury severity of the DDA. The selection of the best ML model was based on the optimum combination of accuracy, F1 score, and area under curve metrics. Ten years of DDA data (2011-2020) provided by the police department of Imphal, India, was used in the present study. The BLR model-without-interaction results revealed that out of twenty categorical variables, nine categorical variables (below 18, 18-24, 25-40, above 40 years age group, two-wheeler, heavy motor vehicle, 12AM-6AM, 6PM-12AM, and hit-object collision) were statistically significant to the injury severity of the DDAs. In interaction model results, there were 11, 1, and 1 significant combinations among categorical variables in two-way, three-way, and four-way interaction models, respectively. The ML model results showed that overall, the XGBoost model was reported as the best-performing model in the first hyperparameter set, and the Single Layer Perceptron model in the second set. These results may be useful for transportation policymakers while implementing any countermeasures to improve road safety in hilly areas.
format Article
id doaj-art-51da0f8db3cf42df802f972a1b5ba284
institution DOAJ
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-51da0f8db3cf42df802f972a1b5ba2842025-08-20T03:22:07ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01206e032611310.1371/journal.pone.0326113Modeling of injury severity of distracted driving accident using statistical and machine learning models.Neero Gumsar SorumMartina Gumsar SorumDistracted Driving (DD) is one of the global causes of high mortality and fatality in road traffic accidents. The increase in the number of distracted driving accidents (DDAs) is one of the concerns among transportation communities. The present study aimed to examine the individual and interacted effects of the influential factors on the injury severity of the DDAs using the Binary Logistic Regression (BLR) method, and at the same, to select the best machine learning (ML) model in predicting the injury severity of the DDA. The selection of the best ML model was based on the optimum combination of accuracy, F1 score, and area under curve metrics. Ten years of DDA data (2011-2020) provided by the police department of Imphal, India, was used in the present study. The BLR model-without-interaction results revealed that out of twenty categorical variables, nine categorical variables (below 18, 18-24, 25-40, above 40 years age group, two-wheeler, heavy motor vehicle, 12AM-6AM, 6PM-12AM, and hit-object collision) were statistically significant to the injury severity of the DDAs. In interaction model results, there were 11, 1, and 1 significant combinations among categorical variables in two-way, three-way, and four-way interaction models, respectively. The ML model results showed that overall, the XGBoost model was reported as the best-performing model in the first hyperparameter set, and the Single Layer Perceptron model in the second set. These results may be useful for transportation policymakers while implementing any countermeasures to improve road safety in hilly areas.https://doi.org/10.1371/journal.pone.0326113
spellingShingle Neero Gumsar Sorum
Martina Gumsar Sorum
Modeling of injury severity of distracted driving accident using statistical and machine learning models.
PLoS ONE
title Modeling of injury severity of distracted driving accident using statistical and machine learning models.
title_full Modeling of injury severity of distracted driving accident using statistical and machine learning models.
title_fullStr Modeling of injury severity of distracted driving accident using statistical and machine learning models.
title_full_unstemmed Modeling of injury severity of distracted driving accident using statistical and machine learning models.
title_short Modeling of injury severity of distracted driving accident using statistical and machine learning models.
title_sort modeling of injury severity of distracted driving accident using statistical and machine learning models
url https://doi.org/10.1371/journal.pone.0326113
work_keys_str_mv AT neerogumsarsorum modelingofinjuryseverityofdistracteddrivingaccidentusingstatisticalandmachinelearningmodels
AT martinagumsarsorum modelingofinjuryseverityofdistracteddrivingaccidentusingstatisticalandmachinelearningmodels