Respiratory disease detection in lung auscultation with convolutional neural networks and CVAE augmentation

The main purpose of this work was to investigate the possibility of detecting respiratory diseases in audio recordings of lung auscultation using modern deep learning tools, as well as to explore the possibility of using data augmentation by generating synthetic spectral representations of audio sam...

Full description

Saved in:
Bibliographic Details
Main Authors: D.V. Panaskin, S.H. Stirenko, D.S. Babko
Format: Article
Language:English
Published: Dnipro State Medical University 2024-10-01
Series:Medičnì Perspektivi
Subjects:
Online Access:https://journals.uran.ua/index.php/2307-0404/article/view/313569
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832557943083499520
author D.V. Panaskin
S.H. Stirenko
D.S. Babko
author_facet D.V. Panaskin
S.H. Stirenko
D.S. Babko
author_sort D.V. Panaskin
collection DOAJ
description The main purpose of this work was to investigate the possibility of detecting respiratory diseases in audio recordings of lung auscultation using modern deep learning tools, as well as to explore the possibility of using data augmentation by generating synthetic spectral representations of audio samples. The ICBHI (International Conference on Biomedical and Health Informatics) dataset was used for training, validation and augmentation. The dataset includes lung auscultations of 126 different subjects, there are a total of 920 sounds, of which 810 have signs of chronic diseases, 75 of non-chronic diseases and 35 with no pathology. The stage of data preprocessing includes discretization to 4kHz frequency, as well as filtering of frequency bands that do not carry information value for the task. In the next step, each sample was transformed into a frequency spectrum and Melspectrograms were generated. To solve the problem of class imbalance, the required number of synthetic spectrograms generated by convolutional variation autoencoders was added. At the stage of building the model, the methods of classical convolutional neural networks were used. The quality of the obtained algorithm was evaluated using a 10-fold cross-validation. Also, to assess the generalization of the proposed method, experiments were performed with the split of audio recordings into training and test sets using patient grouping. Qualitative evaluation of the model was performed using sensitivity, specificity, F1-score and Cohen’s kappa. A score of 98.45% F1-score was achieved for the 5-class classification problem which can contribute to the development of ways to synthesize and augment sensitive medical data. In addition, a cons of existing methods in the generalization of the obtained predictions were revealed, which opens the way for further research in the direction of clinical respiratory diseases detection.
format Article
id doaj-art-0fc63349b4e04fb68330339a6361f679
institution Kabale University
issn 2307-0404
language English
publishDate 2024-10-01
publisher Dnipro State Medical University
record_format Article
series Medičnì Perspektivi
spelling doaj-art-0fc63349b4e04fb68330339a6361f6792025-02-03T01:40:33ZengDnipro State Medical UniversityMedičnì Perspektivi2307-04042024-10-012939610710.26641/2307-0404.2024.3.313569352107Respiratory disease detection in lung auscultation with convolutional neural networks and CVAE augmentationD.V. Panaskin0https://orcid.org/0009-0008-2867-5399S.H. Stirenko1https://orcid.org/0009-0003-2674-3590D.S. Babko2https://orcid.org/0009-0006-5829-1906Ihor Sikorsky Kyiv polytechnic institute, Peremohy ave., 37, Kyiv, 03056Ihor Sikorsky Kyiv polytechnic institute, Peremohy ave., 37, Kyiv, 03056Ihor Sikorsky Kyiv polytechnic institute, Peremohy ave., 37, Kyiv, 03056The main purpose of this work was to investigate the possibility of detecting respiratory diseases in audio recordings of lung auscultation using modern deep learning tools, as well as to explore the possibility of using data augmentation by generating synthetic spectral representations of audio samples. The ICBHI (International Conference on Biomedical and Health Informatics) dataset was used for training, validation and augmentation. The dataset includes lung auscultations of 126 different subjects, there are a total of 920 sounds, of which 810 have signs of chronic diseases, 75 of non-chronic diseases and 35 with no pathology. The stage of data preprocessing includes discretization to 4kHz frequency, as well as filtering of frequency bands that do not carry information value for the task. In the next step, each sample was transformed into a frequency spectrum and Melspectrograms were generated. To solve the problem of class imbalance, the required number of synthetic spectrograms generated by convolutional variation autoencoders was added. At the stage of building the model, the methods of classical convolutional neural networks were used. The quality of the obtained algorithm was evaluated using a 10-fold cross-validation. Also, to assess the generalization of the proposed method, experiments were performed with the split of audio recordings into training and test sets using patient grouping. Qualitative evaluation of the model was performed using sensitivity, specificity, F1-score and Cohen’s kappa. A score of 98.45% F1-score was achieved for the 5-class classification problem which can contribute to the development of ways to synthesize and augment sensitive medical data. In addition, a cons of existing methods in the generalization of the obtained predictions were revealed, which opens the way for further research in the direction of clinical respiratory diseases detection.https://journals.uran.ua/index.php/2307-0404/article/view/313569pulmonary diseaseslung soundsdeep learningconvolutional neural networkconditional variational auto encoder
spellingShingle D.V. Panaskin
S.H. Stirenko
D.S. Babko
Respiratory disease detection in lung auscultation with convolutional neural networks and CVAE augmentation
Medičnì Perspektivi
pulmonary diseases
lung sounds
deep learning
convolutional neural network
conditional variational auto encoder
title Respiratory disease detection in lung auscultation with convolutional neural networks and CVAE augmentation
title_full Respiratory disease detection in lung auscultation with convolutional neural networks and CVAE augmentation
title_fullStr Respiratory disease detection in lung auscultation with convolutional neural networks and CVAE augmentation
title_full_unstemmed Respiratory disease detection in lung auscultation with convolutional neural networks and CVAE augmentation
title_short Respiratory disease detection in lung auscultation with convolutional neural networks and CVAE augmentation
title_sort respiratory disease detection in lung auscultation with convolutional neural networks and cvae augmentation
topic pulmonary diseases
lung sounds
deep learning
convolutional neural network
conditional variational auto encoder
url https://journals.uran.ua/index.php/2307-0404/article/view/313569
work_keys_str_mv AT dvpanaskin respiratorydiseasedetectioninlungauscultationwithconvolutionalneuralnetworksandcvaeaugmentation
AT shstirenko respiratorydiseasedetectioninlungauscultationwithconvolutionalneuralnetworksandcvaeaugmentation
AT dsbabko respiratorydiseasedetectioninlungauscultationwithconvolutionalneuralnetworksandcvaeaugmentation