Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approach

Anomalous chromosomes are the cause of genetic diseases such as cancer, Alzheimer's, Parkinson's, epilepsy, and autism. Karyotype analysis is the standard procedure for diagnosing genetic disorders. Identifying anomalies is often costly, time-consuming, heavily reliant on expert interpreta...

Full description

Saved in:
Bibliographic Details
Main Authors: Sumaira Tabassum, M. Jawad Khan, Javaid Iqbal, Asim Waris, M. Adeel Ijaz
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-01-01
Series:Frontiers in Computational Neuroscience
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fncom.2024.1525895/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832591668334821376
author Sumaira Tabassum
M. Jawad Khan
Javaid Iqbal
Asim Waris
M. Adeel Ijaz
author_facet Sumaira Tabassum
M. Jawad Khan
Javaid Iqbal
Asim Waris
M. Adeel Ijaz
author_sort Sumaira Tabassum
collection DOAJ
description Anomalous chromosomes are the cause of genetic diseases such as cancer, Alzheimer's, Parkinson's, epilepsy, and autism. Karyotype analysis is the standard procedure for diagnosing genetic disorders. Identifying anomalies is often costly, time-consuming, heavily reliant on expert interpretation, and requires considerable manual effort. Efforts are being made to automate karyogram analysis. However, the unavailability of large datasets, particularly those including samples with chromosomal abnormalities, presents a significant challenge. The development of automated models requires extensive labeled and incredibly abnormal data to accurately identify and analyze abnormalities, which are difficult to obtain in sufficient quantities. Although the deep learning-based architecture has yielded state-of-the-art performance in medical image anomaly detection, it cannot be generalized well because of the lack of anomalous datasets. This study introduces a novel hybrid approach that combines unsupervised and supervised learning techniques to overcome the challenges of limited labeled data and scalability in chromosomal analysis. An Autoencoder-based system is initially trained with unlabeled data to identify chromosome patterns. It is fine-tuned on labeled data, followed by a classification step using a Convolutional Neural Network (CNN). A unique dataset of 234,259 chromosome images, including the training, validation, and test sets, was used. Marking a significant achievement in the scale of chromosomal analysis. The proposed hybrid system accurately detects structural anomalies in individual chromosome images, achieving 99.3% accuracy in classifying normal and abnormal chromosomes. We also used a structural similarity index measure and template matching to identify the part of the abnormal chromosome that differed from the normal one. This automated model has the potential to significantly contribute to the early detection and diagnosis of chromosome-related disorders that affect both genetic health and neurological behavior.
format Article
id doaj-art-f131fd692c374e58843b0341a4cd8c3e
institution Kabale University
issn 1662-5188
language English
publishDate 2025-01-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Computational Neuroscience
spelling doaj-art-f131fd692c374e58843b0341a4cd8c3e2025-01-22T07:15:26ZengFrontiers Media S.A.Frontiers in Computational Neuroscience1662-51882025-01-011810.3389/fncom.2024.15258951525895Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approachSumaira TabassumM. Jawad KhanJavaid IqbalAsim WarisM. Adeel IjazAnomalous chromosomes are the cause of genetic diseases such as cancer, Alzheimer's, Parkinson's, epilepsy, and autism. Karyotype analysis is the standard procedure for diagnosing genetic disorders. Identifying anomalies is often costly, time-consuming, heavily reliant on expert interpretation, and requires considerable manual effort. Efforts are being made to automate karyogram analysis. However, the unavailability of large datasets, particularly those including samples with chromosomal abnormalities, presents a significant challenge. The development of automated models requires extensive labeled and incredibly abnormal data to accurately identify and analyze abnormalities, which are difficult to obtain in sufficient quantities. Although the deep learning-based architecture has yielded state-of-the-art performance in medical image anomaly detection, it cannot be generalized well because of the lack of anomalous datasets. This study introduces a novel hybrid approach that combines unsupervised and supervised learning techniques to overcome the challenges of limited labeled data and scalability in chromosomal analysis. An Autoencoder-based system is initially trained with unlabeled data to identify chromosome patterns. It is fine-tuned on labeled data, followed by a classification step using a Convolutional Neural Network (CNN). A unique dataset of 234,259 chromosome images, including the training, validation, and test sets, was used. Marking a significant achievement in the scale of chromosomal analysis. The proposed hybrid system accurately detects structural anomalies in individual chromosome images, achieving 99.3% accuracy in classifying normal and abnormal chromosomes. We also used a structural similarity index measure and template matching to identify the part of the abnormal chromosome that differed from the normal one. This automated model has the potential to significantly contribute to the early detection and diagnosis of chromosome-related disorders that affect both genetic health and neurological behavior.https://www.frontiersin.org/articles/10.3389/fncom.2024.1525895/fullchromosome anomaliescognitive sciencesmachine learningneurological healthneurodevelopmental disordersneurological disorders
spellingShingle Sumaira Tabassum
M. Jawad Khan
Javaid Iqbal
Asim Waris
M. Adeel Ijaz
Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approach
Frontiers in Computational Neuroscience
chromosome anomalies
cognitive sciences
machine learning
neurological health
neurodevelopmental disorders
neurological disorders
title Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approach
title_full Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approach
title_fullStr Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approach
title_full_unstemmed Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approach
title_short Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approach
title_sort automated karyogram analysis for early detection of genetic and neurodegenerative disorders a hybrid machine learning approach
topic chromosome anomalies
cognitive sciences
machine learning
neurological health
neurodevelopmental disorders
neurological disorders
url https://www.frontiersin.org/articles/10.3389/fncom.2024.1525895/full
work_keys_str_mv AT sumairatabassum automatedkaryogramanalysisforearlydetectionofgeneticandneurodegenerativedisordersahybridmachinelearningapproach
AT mjawadkhan automatedkaryogramanalysisforearlydetectionofgeneticandneurodegenerativedisordersahybridmachinelearningapproach
AT javaidiqbal automatedkaryogramanalysisforearlydetectionofgeneticandneurodegenerativedisordersahybridmachinelearningapproach
AT asimwaris automatedkaryogramanalysisforearlydetectionofgeneticandneurodegenerativedisordersahybridmachinelearningapproach
AT madeelijaz automatedkaryogramanalysisforearlydetectionofgeneticandneurodegenerativedisordersahybridmachinelearningapproach