Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approach
Anomalous chromosomes are the cause of genetic diseases such as cancer, Alzheimer's, Parkinson's, epilepsy, and autism. Karyotype analysis is the standard procedure for diagnosing genetic disorders. Identifying anomalies is often costly, time-consuming, heavily reliant on expert interpreta...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2025-01-01
|
Series: | Frontiers in Computational Neuroscience |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fncom.2024.1525895/full |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832591668334821376 |
---|---|
author | Sumaira Tabassum M. Jawad Khan Javaid Iqbal Asim Waris M. Adeel Ijaz |
author_facet | Sumaira Tabassum M. Jawad Khan Javaid Iqbal Asim Waris M. Adeel Ijaz |
author_sort | Sumaira Tabassum |
collection | DOAJ |
description | Anomalous chromosomes are the cause of genetic diseases such as cancer, Alzheimer's, Parkinson's, epilepsy, and autism. Karyotype analysis is the standard procedure for diagnosing genetic disorders. Identifying anomalies is often costly, time-consuming, heavily reliant on expert interpretation, and requires considerable manual effort. Efforts are being made to automate karyogram analysis. However, the unavailability of large datasets, particularly those including samples with chromosomal abnormalities, presents a significant challenge. The development of automated models requires extensive labeled and incredibly abnormal data to accurately identify and analyze abnormalities, which are difficult to obtain in sufficient quantities. Although the deep learning-based architecture has yielded state-of-the-art performance in medical image anomaly detection, it cannot be generalized well because of the lack of anomalous datasets. This study introduces a novel hybrid approach that combines unsupervised and supervised learning techniques to overcome the challenges of limited labeled data and scalability in chromosomal analysis. An Autoencoder-based system is initially trained with unlabeled data to identify chromosome patterns. It is fine-tuned on labeled data, followed by a classification step using a Convolutional Neural Network (CNN). A unique dataset of 234,259 chromosome images, including the training, validation, and test sets, was used. Marking a significant achievement in the scale of chromosomal analysis. The proposed hybrid system accurately detects structural anomalies in individual chromosome images, achieving 99.3% accuracy in classifying normal and abnormal chromosomes. We also used a structural similarity index measure and template matching to identify the part of the abnormal chromosome that differed from the normal one. This automated model has the potential to significantly contribute to the early detection and diagnosis of chromosome-related disorders that affect both genetic health and neurological behavior. |
format | Article |
id | doaj-art-f131fd692c374e58843b0341a4cd8c3e |
institution | Kabale University |
issn | 1662-5188 |
language | English |
publishDate | 2025-01-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Computational Neuroscience |
spelling | doaj-art-f131fd692c374e58843b0341a4cd8c3e2025-01-22T07:15:26ZengFrontiers Media S.A.Frontiers in Computational Neuroscience1662-51882025-01-011810.3389/fncom.2024.15258951525895Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approachSumaira TabassumM. Jawad KhanJavaid IqbalAsim WarisM. Adeel IjazAnomalous chromosomes are the cause of genetic diseases such as cancer, Alzheimer's, Parkinson's, epilepsy, and autism. Karyotype analysis is the standard procedure for diagnosing genetic disorders. Identifying anomalies is often costly, time-consuming, heavily reliant on expert interpretation, and requires considerable manual effort. Efforts are being made to automate karyogram analysis. However, the unavailability of large datasets, particularly those including samples with chromosomal abnormalities, presents a significant challenge. The development of automated models requires extensive labeled and incredibly abnormal data to accurately identify and analyze abnormalities, which are difficult to obtain in sufficient quantities. Although the deep learning-based architecture has yielded state-of-the-art performance in medical image anomaly detection, it cannot be generalized well because of the lack of anomalous datasets. This study introduces a novel hybrid approach that combines unsupervised and supervised learning techniques to overcome the challenges of limited labeled data and scalability in chromosomal analysis. An Autoencoder-based system is initially trained with unlabeled data to identify chromosome patterns. It is fine-tuned on labeled data, followed by a classification step using a Convolutional Neural Network (CNN). A unique dataset of 234,259 chromosome images, including the training, validation, and test sets, was used. Marking a significant achievement in the scale of chromosomal analysis. The proposed hybrid system accurately detects structural anomalies in individual chromosome images, achieving 99.3% accuracy in classifying normal and abnormal chromosomes. We also used a structural similarity index measure and template matching to identify the part of the abnormal chromosome that differed from the normal one. This automated model has the potential to significantly contribute to the early detection and diagnosis of chromosome-related disorders that affect both genetic health and neurological behavior.https://www.frontiersin.org/articles/10.3389/fncom.2024.1525895/fullchromosome anomaliescognitive sciencesmachine learningneurological healthneurodevelopmental disordersneurological disorders |
spellingShingle | Sumaira Tabassum M. Jawad Khan Javaid Iqbal Asim Waris M. Adeel Ijaz Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approach Frontiers in Computational Neuroscience chromosome anomalies cognitive sciences machine learning neurological health neurodevelopmental disorders neurological disorders |
title | Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approach |
title_full | Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approach |
title_fullStr | Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approach |
title_full_unstemmed | Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approach |
title_short | Automated karyogram analysis for early detection of genetic and neurodegenerative disorders: a hybrid machine learning approach |
title_sort | automated karyogram analysis for early detection of genetic and neurodegenerative disorders a hybrid machine learning approach |
topic | chromosome anomalies cognitive sciences machine learning neurological health neurodevelopmental disorders neurological disorders |
url | https://www.frontiersin.org/articles/10.3389/fncom.2024.1525895/full |
work_keys_str_mv | AT sumairatabassum automatedkaryogramanalysisforearlydetectionofgeneticandneurodegenerativedisordersahybridmachinelearningapproach AT mjawadkhan automatedkaryogramanalysisforearlydetectionofgeneticandneurodegenerativedisordersahybridmachinelearningapproach AT javaidiqbal automatedkaryogramanalysisforearlydetectionofgeneticandneurodegenerativedisordersahybridmachinelearningapproach AT asimwaris automatedkaryogramanalysisforearlydetectionofgeneticandneurodegenerativedisordersahybridmachinelearningapproach AT madeelijaz automatedkaryogramanalysisforearlydetectionofgeneticandneurodegenerativedisordersahybridmachinelearningapproach |