Vocal performance evaluation of the intelligent note recognition method based on deep learning

Abstract This study aims to optimize the ability of note recognition and improve the accuracy of vocal performance evaluation. Firstly, the basic theory of music is analyzed. Secondly, the convolutional neural network (CNN) in deep learning (DL) is selected to integrate gated recurrent units for opt...

Full description

Saved in:
Bibliographic Details
Main Author: Dongyun Chang
Format: Article
Language:English
Published: Nature Portfolio 2025-04-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-99357-2
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849713603250225152
author Dongyun Chang
author_facet Dongyun Chang
author_sort Dongyun Chang
collection DOAJ
description Abstract This study aims to optimize the ability of note recognition and improve the accuracy of vocal performance evaluation. Firstly, the basic theory of music is analyzed. Secondly, the convolutional neural network (CNN) in deep learning (DL) is selected to integrate gated recurrent units for optimization. Moreover, the attention mechanism is added to the optimized model to implement an intelligent note recognition model, and the results of note recognition are compared with those of common models. Finally, according to the results of audio signal classification, a vocal performance evaluation model optimized based on the attention mechanism is constructed. The accuracy of the model under different feature inputs is compared. The results indicate that different models show obvious differences in F-value, accuracy, precision, and recall. The attention mechanism-gated recurrent convolutional neural network (A-GRCNN) model performs best on all indicators. Specifically, this model’s accuracy, recall, F-value, and precision reach 0.961, 0.958, 0.963, and 0.970. The incorporation of multiple feature inputs can remarkably enhance the accuracy of vocal performance evaluation, especially the combination of constant Q Transform features, which is the most outstanding. This study improves the accuracy and reliability of music information processing, promotes the application of DL technology in music, and contributes to optimizing vocal performance evaluation.
format Article
id doaj-art-45ef8cd3021f4faeb05340f1e016cf6b
institution DOAJ
issn 2045-2322
language English
publishDate 2025-04-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-45ef8cd3021f4faeb05340f1e016cf6b2025-08-20T03:13:55ZengNature PortfolioScientific Reports2045-23222025-04-0115112110.1038/s41598-025-99357-2Vocal performance evaluation of the intelligent note recognition method based on deep learningDongyun Chang0School of Music, Qinghai Normal UniversityAbstract This study aims to optimize the ability of note recognition and improve the accuracy of vocal performance evaluation. Firstly, the basic theory of music is analyzed. Secondly, the convolutional neural network (CNN) in deep learning (DL) is selected to integrate gated recurrent units for optimization. Moreover, the attention mechanism is added to the optimized model to implement an intelligent note recognition model, and the results of note recognition are compared with those of common models. Finally, according to the results of audio signal classification, a vocal performance evaluation model optimized based on the attention mechanism is constructed. The accuracy of the model under different feature inputs is compared. The results indicate that different models show obvious differences in F-value, accuracy, precision, and recall. The attention mechanism-gated recurrent convolutional neural network (A-GRCNN) model performs best on all indicators. Specifically, this model’s accuracy, recall, F-value, and precision reach 0.961, 0.958, 0.963, and 0.970. The incorporation of multiple feature inputs can remarkably enhance the accuracy of vocal performance evaluation, especially the combination of constant Q Transform features, which is the most outstanding. This study improves the accuracy and reliability of music information processing, promotes the application of DL technology in music, and contributes to optimizing vocal performance evaluation.https://doi.org/10.1038/s41598-025-99357-2Deep learningNote recognitionVocal performance evaluationAttention mechanismNeural network
spellingShingle Dongyun Chang
Vocal performance evaluation of the intelligent note recognition method based on deep learning
Scientific Reports
Deep learning
Note recognition
Vocal performance evaluation
Attention mechanism
Neural network
title Vocal performance evaluation of the intelligent note recognition method based on deep learning
title_full Vocal performance evaluation of the intelligent note recognition method based on deep learning
title_fullStr Vocal performance evaluation of the intelligent note recognition method based on deep learning
title_full_unstemmed Vocal performance evaluation of the intelligent note recognition method based on deep learning
title_short Vocal performance evaluation of the intelligent note recognition method based on deep learning
title_sort vocal performance evaluation of the intelligent note recognition method based on deep learning
topic Deep learning
Note recognition
Vocal performance evaluation
Attention mechanism
Neural network
url https://doi.org/10.1038/s41598-025-99357-2
work_keys_str_mv AT dongyunchang vocalperformanceevaluationoftheintelligentnoterecognitionmethodbasedondeeplearning