Erroneous Classification and Coding as a Limitation for Big Data Analyses: Causes and Impacts Illustrated by the Diagnosis of Clavicle Injuries

<b>Background/Objectives</b>: Clavicle injuries are common and seem to be frequently subject to diagnostic misclassification. The accurate identification of clavicle fractures is essential, particularly for registry and Big Data analyses. This study aims to assess the frequency of diagno...

Full description

Saved in:
Bibliographic Details
Main Authors: Robert Raché, Lara-Sophie Claudé, Marcus Vollmer, Lyubomir Haralambiev, Denis Gümbel, Axel Ekkernkamp, Martin Jordan, Stefan Schulz-Drost, Mustafa Sinan Bakir
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Diagnostics
Subjects:
Online Access:https://www.mdpi.com/2075-4418/15/2/131
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832588730250035200
author Robert Raché
Lara-Sophie Claudé
Marcus Vollmer
Lyubomir Haralambiev
Denis Gümbel
Axel Ekkernkamp
Martin Jordan
Stefan Schulz-Drost
Mustafa Sinan Bakir
author_facet Robert Raché
Lara-Sophie Claudé
Marcus Vollmer
Lyubomir Haralambiev
Denis Gümbel
Axel Ekkernkamp
Martin Jordan
Stefan Schulz-Drost
Mustafa Sinan Bakir
author_sort Robert Raché
collection DOAJ
description <b>Background/Objectives</b>: Clavicle injuries are common and seem to be frequently subject to diagnostic misclassification. The accurate identification of clavicle fractures is essential, particularly for registry and Big Data analyses. This study aims to assess the frequency of diagnostic errors in clavicle injury classifications. <b>Methods</b>: This retrospective study analyzed patient data from two Level 1 trauma centers, covering the period from 2008 to 2019. Included were cases with ICD-coded diagnoses of medial, midshaft, and lateral clavicle fractures, as well as sternoclavicular and acromioclavicular joint dislocations. Radiological images were re-evaluated, and discharge summaries, radiological reports, and billing codes were examined for diagnostic accuracy. <b>Results</b>: A total of 1503 patients were included, accounting for 1855 initial injury diagnoses. In contrast, 1846 were detected upon review. Initially, 14.4% of cases were coded as medial clavicle fractures, whereas only 5.2% were confirmed. The misclassification rate was 82.8% for initial medial fractures (<i>p</i> < 0.001), 42.5% for midshaft fractures (<i>p</i> < 0.001), and 34.2% for lateral fractures (<i>p</i> < 0.001). Billing codes and discharge summaries were the most error-prone categories, with error rates of 64% and 36% of all misclassified cases, respectively. Over three-quarters of the cases with discharge summary errors also exhibited errors in other categories, while billing errors co-occurred with other category errors in just over half of the cases (<i>p</i> < 0.001). The likelihood of radiological diagnostic error increased with the number of imaging modalities used, from 19.7% with a single modality to 30.5% with two and 40.7% with three. <b>Conclusions</b>: Our findings indicate that diagnostic misclassification of clavicle fractures is common, particularly between medial and midshaft fractures, often resulting from errors in multiple categories. Further prospective studies are needed, as accurate classification is foundational for the reliable application of Big Data and AI-based analyses in clinical research.
format Article
id doaj-art-d6762ce7739a48369646dd051d6e39f0
institution Kabale University
issn 2075-4418
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Diagnostics
spelling doaj-art-d6762ce7739a48369646dd051d6e39f02025-01-24T13:28:50ZengMDPI AGDiagnostics2075-44182025-01-0115213110.3390/diagnostics15020131Erroneous Classification and Coding as a Limitation for Big Data Analyses: Causes and Impacts Illustrated by the Diagnosis of Clavicle InjuriesRobert Raché0Lara-Sophie Claudé1Marcus Vollmer2Lyubomir Haralambiev3Denis Gümbel4Axel Ekkernkamp5Martin Jordan6Stefan Schulz-Drost7Mustafa Sinan Bakir8Department of Orthopedics, Trauma Surgery and Rehabilitative Medicine, University Medicine Greifswald, Ferdinand-Sauerbruch-Straße, 17471 Greifswald, GermanyDepartment of Orthopedics, Trauma Surgery and Rehabilitative Medicine, University Medicine Greifswald, Ferdinand-Sauerbruch-Straße, 17471 Greifswald, GermanyInstitute of Bioinformatics, University Medicine Greifswald, Felix-Hausdorff-Str. 8, 17475 Greifswald, GermanyDepartment of Orthopedics, Trauma Surgery and Rehabilitative Medicine, University Medicine Greifswald, Ferdinand-Sauerbruch-Straße, 17471 Greifswald, GermanyDepartment of Orthopedics, Trauma Surgery and Rehabilitative Medicine, University Medicine Greifswald, Ferdinand-Sauerbruch-Straße, 17471 Greifswald, GermanyDepartment of Orthopedics, Trauma Surgery and Rehabilitative Medicine, University Medicine Greifswald, Ferdinand-Sauerbruch-Straße, 17471 Greifswald, GermanyDepartment of Orthopedics, Trauma Surgery and Rehabilitative Medicine, University Medicine Greifswald, Ferdinand-Sauerbruch-Straße, 17471 Greifswald, GermanyDepartment of Trauma and Orthopedic Surgery, University Hospital Erlangen, Krankenhausstr. 12, 91054 Erlangen, GermanyDepartment of Orthopedics, Trauma Surgery and Rehabilitative Medicine, University Medicine Greifswald, Ferdinand-Sauerbruch-Straße, 17471 Greifswald, Germany<b>Background/Objectives</b>: Clavicle injuries are common and seem to be frequently subject to diagnostic misclassification. The accurate identification of clavicle fractures is essential, particularly for registry and Big Data analyses. This study aims to assess the frequency of diagnostic errors in clavicle injury classifications. <b>Methods</b>: This retrospective study analyzed patient data from two Level 1 trauma centers, covering the period from 2008 to 2019. Included were cases with ICD-coded diagnoses of medial, midshaft, and lateral clavicle fractures, as well as sternoclavicular and acromioclavicular joint dislocations. Radiological images were re-evaluated, and discharge summaries, radiological reports, and billing codes were examined for diagnostic accuracy. <b>Results</b>: A total of 1503 patients were included, accounting for 1855 initial injury diagnoses. In contrast, 1846 were detected upon review. Initially, 14.4% of cases were coded as medial clavicle fractures, whereas only 5.2% were confirmed. The misclassification rate was 82.8% for initial medial fractures (<i>p</i> < 0.001), 42.5% for midshaft fractures (<i>p</i> < 0.001), and 34.2% for lateral fractures (<i>p</i> < 0.001). Billing codes and discharge summaries were the most error-prone categories, with error rates of 64% and 36% of all misclassified cases, respectively. Over three-quarters of the cases with discharge summary errors also exhibited errors in other categories, while billing errors co-occurred with other category errors in just over half of the cases (<i>p</i> < 0.001). The likelihood of radiological diagnostic error increased with the number of imaging modalities used, from 19.7% with a single modality to 30.5% with two and 40.7% with three. <b>Conclusions</b>: Our findings indicate that diagnostic misclassification of clavicle fractures is common, particularly between medial and midshaft fractures, often resulting from errors in multiple categories. Further prospective studies are needed, as accurate classification is foundational for the reliable application of Big Data and AI-based analyses in clinical research.https://www.mdpi.com/2075-4418/15/2/131big data analysesAI-based analysesmisclassificationclavicle injuriesclavicle fractures
spellingShingle Robert Raché
Lara-Sophie Claudé
Marcus Vollmer
Lyubomir Haralambiev
Denis Gümbel
Axel Ekkernkamp
Martin Jordan
Stefan Schulz-Drost
Mustafa Sinan Bakir
Erroneous Classification and Coding as a Limitation for Big Data Analyses: Causes and Impacts Illustrated by the Diagnosis of Clavicle Injuries
Diagnostics
big data analyses
AI-based analyses
misclassification
clavicle injuries
clavicle fractures
title Erroneous Classification and Coding as a Limitation for Big Data Analyses: Causes and Impacts Illustrated by the Diagnosis of Clavicle Injuries
title_full Erroneous Classification and Coding as a Limitation for Big Data Analyses: Causes and Impacts Illustrated by the Diagnosis of Clavicle Injuries
title_fullStr Erroneous Classification and Coding as a Limitation for Big Data Analyses: Causes and Impacts Illustrated by the Diagnosis of Clavicle Injuries
title_full_unstemmed Erroneous Classification and Coding as a Limitation for Big Data Analyses: Causes and Impacts Illustrated by the Diagnosis of Clavicle Injuries
title_short Erroneous Classification and Coding as a Limitation for Big Data Analyses: Causes and Impacts Illustrated by the Diagnosis of Clavicle Injuries
title_sort erroneous classification and coding as a limitation for big data analyses causes and impacts illustrated by the diagnosis of clavicle injuries
topic big data analyses
AI-based analyses
misclassification
clavicle injuries
clavicle fractures
url https://www.mdpi.com/2075-4418/15/2/131
work_keys_str_mv AT robertrache erroneousclassificationandcodingasalimitationforbigdataanalysescausesandimpactsillustratedbythediagnosisofclavicleinjuries
AT larasophieclaude erroneousclassificationandcodingasalimitationforbigdataanalysescausesandimpactsillustratedbythediagnosisofclavicleinjuries
AT marcusvollmer erroneousclassificationandcodingasalimitationforbigdataanalysescausesandimpactsillustratedbythediagnosisofclavicleinjuries
AT lyubomirharalambiev erroneousclassificationandcodingasalimitationforbigdataanalysescausesandimpactsillustratedbythediagnosisofclavicleinjuries
AT denisgumbel erroneousclassificationandcodingasalimitationforbigdataanalysescausesandimpactsillustratedbythediagnosisofclavicleinjuries
AT axelekkernkamp erroneousclassificationandcodingasalimitationforbigdataanalysescausesandimpactsillustratedbythediagnosisofclavicleinjuries
AT martinjordan erroneousclassificationandcodingasalimitationforbigdataanalysescausesandimpactsillustratedbythediagnosisofclavicleinjuries
AT stefanschulzdrost erroneousclassificationandcodingasalimitationforbigdataanalysescausesandimpactsillustratedbythediagnosisofclavicleinjuries
AT mustafasinanbakir erroneousclassificationandcodingasalimitationforbigdataanalysescausesandimpactsillustratedbythediagnosisofclavicleinjuries