Integrating deep learning and transfer learning: optimizing white blood cells classification in medical educational institutions

Abstract White blood cells (WBCs) play a crucial role in the immune system, protecting the body from infections and foreign invaders. Abnormalities in WBCs can be indicative of various conditions, including leukemia. WBCs classification is pivotal for diagnosing hematological disorders. This study a...

Full description

Saved in:
Bibliographic Details
Main Authors: M. Hussein, Faten Abd El-Sattar Zahran El-Mougi
Format: Article
Language:English
Published: SpringerOpen 2025-07-01
Series:Journal of Big Data
Subjects:
Online Access:https://doi.org/10.1186/s40537-025-01235-1
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract White blood cells (WBCs) play a crucial role in the immune system, protecting the body from infections and foreign invaders. Abnormalities in WBCs can be indicative of various conditions, including leukemia. WBCs classification is pivotal for diagnosing hematological disorders. This study advances automated WBCs analysis through an 8-class classification framework encompassing rare but clinically critical subtypes: neutrophils, eosinophils, basophils, lymphocytes, monocytes, immature granulocytes (IGs), erythroblasts, and platelets. Leveraging a dataset of 17,092 CellaVision DM96-generated images standardized for clinical relevance, we implement rigorous preprocessing (normalization, resizing) and dynamic augmentation (rotations, flips) to enhance robustness. Six architectures are evaluated: ResNet50, InceptionV3, EfficientNetB3, MobileNetV3, Swin Transformer, and a custom convolutional neural network (CNN). ResNet50 emerged as the top performer 98.83% accuracy, followed by InceptionV3 98.77% and Swin Transformer 98.71%, demonstrating the efficacy of transfer learning and transformer-based attention mechanisms. Class-weighted loss mitigated dataset imbalance, achieving > 0.98 F1-scores for 6/8 classes. Computational efficiency analysis revealed MobileNetV3 as optimal for deployment (3.43 ms/inference). The study addresses key challenges—class imbalance, model interpretability via Gradient-weighted Class Activation Mapping (Grad-CAM) visualizations—and validates improved diagnostic precision over prior work. By integrating clinically critical subtypes and state-of-the-art architectures, it provides a robust tool for medical education and practice, enabling early detection of leukemia, sepsis, and myelodysplastic syndromes. This study can enhance the training of medical students and doctors, equipping them with better tools for diagnosis and decision-making. Furthermore, the ability to classify a broader range of WBCs types could lead to more accurate and early diagnoses of diseases, ultimately improving patient care.
ISSN:2196-1115