Data Clustering Improves Siamese Neural Networks Classification of Parkinson’s Disease

Parkinson’s disease (PD) is a clinical neurodegenerative disease having symptoms like tremor, rigidity, and postural disability. According to Harvard, about 60,000 of American citizens are diagnosed with PD yearly, with more than 10 million people infected worldwide. An estimate of 4% of the people...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohamed Shalaby, Nahla A. Belal, Yasser Omar
Format: Article
Language:English
Published: Wiley 2021-01-01
Series:Complexity
Online Access:http://dx.doi.org/10.1155/2021/3112771
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832560627610025984
author Mohamed Shalaby
Nahla A. Belal
Yasser Omar
author_facet Mohamed Shalaby
Nahla A. Belal
Yasser Omar
author_sort Mohamed Shalaby
collection DOAJ
description Parkinson’s disease (PD) is a clinical neurodegenerative disease having symptoms like tremor, rigidity, and postural disability. According to Harvard, about 60,000 of American citizens are diagnosed with PD yearly, with more than 10 million people infected worldwide. An estimate of 4% of the people have PD before they reach the age 50; however, the incident increases with age. Diagnosis of PD relies on the expertise of the physician and depends on several established clinical criteria. This makes the diagnosis subjective and inefficient. Hence, continuous efforts are being made to enhance the diagnosis of PD using deep learning approaches that rely on experienced neurologists. Siamese neural networks mainly work on two different input vectors and are used in comparison of output vectors. Moreover, clustering a dataset before applying classification enhances the distribution of similar samples among groups. In addition, applying the Siamese network can overcome the limitation of samples per class in the dataset by guiding the network to learn differences between samples rather than focusing on learning specific classes. In this paper, a Siamese neural network is applied to diagnose PD. Siamese networks predict the sample class by estimating how similar a sample is to other samples. The idea behind this work is clustering the dataset before training the network, as different pairs that belong to the same cluster are candidates to be mistaken by the network and assumed to be matched pairs. To overcome this problem, the dataset is first clustered, and then the architecture feeds the network to pairs of the same cluster. The proposed framework is concerned with comparing the performance when using clustered against unclustered data. The proposed framework outperforms the conventional framework without clustering. The accuracy achieved for classifying unclustered PD patients reached 76.75%, while it reached 84.02% for clustered data, outperforming the same technique on unclustered data. The significance of this study is in the enhanced performance achieved due to the clustering of data, which shows a promising framework to enhance the diagnostic capability of computer-aided disease diagnostic tools.
format Article
id doaj-art-d54070b2b14c4eb0a82700b2520681eb
institution Kabale University
issn 1076-2787
1099-0526
language English
publishDate 2021-01-01
publisher Wiley
record_format Article
series Complexity
spelling doaj-art-d54070b2b14c4eb0a82700b2520681eb2025-02-03T01:27:07ZengWileyComplexity1076-27871099-05262021-01-01202110.1155/2021/31127713112771Data Clustering Improves Siamese Neural Networks Classification of Parkinson’s DiseaseMohamed Shalaby0Nahla A. Belal1Yasser Omar2Egyptian Armed Forces, Cairo, EgyptCollege of Computing and Information Technology, Arab Academy for Science, Technology and Maritime Transport (AASTMT), Smart Village, Cairo, EgyptCollege of Computing and Information Technology, Arab Academy for Science, Technology and Maritime Transport (AASTMT), Smart Village, Cairo, EgyptParkinson’s disease (PD) is a clinical neurodegenerative disease having symptoms like tremor, rigidity, and postural disability. According to Harvard, about 60,000 of American citizens are diagnosed with PD yearly, with more than 10 million people infected worldwide. An estimate of 4% of the people have PD before they reach the age 50; however, the incident increases with age. Diagnosis of PD relies on the expertise of the physician and depends on several established clinical criteria. This makes the diagnosis subjective and inefficient. Hence, continuous efforts are being made to enhance the diagnosis of PD using deep learning approaches that rely on experienced neurologists. Siamese neural networks mainly work on two different input vectors and are used in comparison of output vectors. Moreover, clustering a dataset before applying classification enhances the distribution of similar samples among groups. In addition, applying the Siamese network can overcome the limitation of samples per class in the dataset by guiding the network to learn differences between samples rather than focusing on learning specific classes. In this paper, a Siamese neural network is applied to diagnose PD. Siamese networks predict the sample class by estimating how similar a sample is to other samples. The idea behind this work is clustering the dataset before training the network, as different pairs that belong to the same cluster are candidates to be mistaken by the network and assumed to be matched pairs. To overcome this problem, the dataset is first clustered, and then the architecture feeds the network to pairs of the same cluster. The proposed framework is concerned with comparing the performance when using clustered against unclustered data. The proposed framework outperforms the conventional framework without clustering. The accuracy achieved for classifying unclustered PD patients reached 76.75%, while it reached 84.02% for clustered data, outperforming the same technique on unclustered data. The significance of this study is in the enhanced performance achieved due to the clustering of data, which shows a promising framework to enhance the diagnostic capability of computer-aided disease diagnostic tools.http://dx.doi.org/10.1155/2021/3112771
spellingShingle Mohamed Shalaby
Nahla A. Belal
Yasser Omar
Data Clustering Improves Siamese Neural Networks Classification of Parkinson’s Disease
Complexity
title Data Clustering Improves Siamese Neural Networks Classification of Parkinson’s Disease
title_full Data Clustering Improves Siamese Neural Networks Classification of Parkinson’s Disease
title_fullStr Data Clustering Improves Siamese Neural Networks Classification of Parkinson’s Disease
title_full_unstemmed Data Clustering Improves Siamese Neural Networks Classification of Parkinson’s Disease
title_short Data Clustering Improves Siamese Neural Networks Classification of Parkinson’s Disease
title_sort data clustering improves siamese neural networks classification of parkinson s disease
url http://dx.doi.org/10.1155/2021/3112771
work_keys_str_mv AT mohamedshalaby dataclusteringimprovessiameseneuralnetworksclassificationofparkinsonsdisease
AT nahlaabelal dataclusteringimprovessiameseneuralnetworksclassificationofparkinsonsdisease
AT yasseromar dataclusteringimprovessiameseneuralnetworksclassificationofparkinsonsdisease