Casualty Analysis of the Drivers in Traffic Accidents in Turkey: A CHAID Decision Tree Model

The number of traffic accidents in a region rises as the vehicle–km value in traffic increases. Furthermore, since automobiles make up the highest proportion of vehicles in traffic, they represent the greatest weight in traffic accidents. This study aims to establish a model to predict the driver’s...

Full description

Saved in:
Bibliographic Details
Main Authors: Zeliha Cagla Kuyumcu, Hakan Aslan, Nilufer Yurtay
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/14/24/11693
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The number of traffic accidents in a region rises as the vehicle–km value in traffic increases. Furthermore, since automobiles make up the highest proportion of vehicles in traffic, they represent the greatest weight in traffic accidents. This study aims to establish a model to predict the driver’s status (survived–injured–dead) as a result of the fatal-injury type of accident. The size of the vehicles suppresses the direct factors related to drivers by having a significant and dominant effect on the analysis of the results of the accidents by concealing the other important factors which must be taken into consideration with regard to the casualty levels of the drivers. Consequently, this paper focuses on automobiles, which are the most frequently involved vehicle type in accidents. Furthermore, the dataset representing the accidents that occurred in Turkey between 2015 and 2021 was employed for the analysis of the effects of the attributes of the drivers on the outcome of casualties for automobile-related accidents alone. The uniqueness of this research stems from being the first study in Turkey to investigate the severity levels of the drivers involved in automobile-related accidents. In addition, this study highlights the preventable factors investigated relatively less than other factors in the literature in order to establish a successful model. The difference between the success of the models with regard to accuracy obtained through dominant and investigated factors is only 5.0%. Random Forests, Naïve Bayes, and CHAID (Chi-squared Automatic Interaction Detection) models were established and compared as decision tree algorithms. The results revealed the fact that the CHAID model produced the most successful outcomes among them. Driver fault, gender, education level, and age, along with alcohol usage and surface condition, were found to be significant influential factors for the severity of traffic accidents.
ISSN:2076-3417