Vocal performance evaluation of the intelligent note recognition method based on deep learning
Abstract This study aims to optimize the ability of note recognition and improve the accuracy of vocal performance evaluation. Firstly, the basic theory of music is analyzed. Secondly, the convolutional neural network (CNN) in deep learning (DL) is selected to integrate gated recurrent units for opt...
Saved in:
| Main Author: | |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-04-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-99357-2 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849713603250225152 |
|---|---|
| author | Dongyun Chang |
| author_facet | Dongyun Chang |
| author_sort | Dongyun Chang |
| collection | DOAJ |
| description | Abstract This study aims to optimize the ability of note recognition and improve the accuracy of vocal performance evaluation. Firstly, the basic theory of music is analyzed. Secondly, the convolutional neural network (CNN) in deep learning (DL) is selected to integrate gated recurrent units for optimization. Moreover, the attention mechanism is added to the optimized model to implement an intelligent note recognition model, and the results of note recognition are compared with those of common models. Finally, according to the results of audio signal classification, a vocal performance evaluation model optimized based on the attention mechanism is constructed. The accuracy of the model under different feature inputs is compared. The results indicate that different models show obvious differences in F-value, accuracy, precision, and recall. The attention mechanism-gated recurrent convolutional neural network (A-GRCNN) model performs best on all indicators. Specifically, this model’s accuracy, recall, F-value, and precision reach 0.961, 0.958, 0.963, and 0.970. The incorporation of multiple feature inputs can remarkably enhance the accuracy of vocal performance evaluation, especially the combination of constant Q Transform features, which is the most outstanding. This study improves the accuracy and reliability of music information processing, promotes the application of DL technology in music, and contributes to optimizing vocal performance evaluation. |
| format | Article |
| id | doaj-art-45ef8cd3021f4faeb05340f1e016cf6b |
| institution | DOAJ |
| issn | 2045-2322 |
| language | English |
| publishDate | 2025-04-01 |
| publisher | Nature Portfolio |
| record_format | Article |
| series | Scientific Reports |
| spelling | doaj-art-45ef8cd3021f4faeb05340f1e016cf6b2025-08-20T03:13:55ZengNature PortfolioScientific Reports2045-23222025-04-0115112110.1038/s41598-025-99357-2Vocal performance evaluation of the intelligent note recognition method based on deep learningDongyun Chang0School of Music, Qinghai Normal UniversityAbstract This study aims to optimize the ability of note recognition and improve the accuracy of vocal performance evaluation. Firstly, the basic theory of music is analyzed. Secondly, the convolutional neural network (CNN) in deep learning (DL) is selected to integrate gated recurrent units for optimization. Moreover, the attention mechanism is added to the optimized model to implement an intelligent note recognition model, and the results of note recognition are compared with those of common models. Finally, according to the results of audio signal classification, a vocal performance evaluation model optimized based on the attention mechanism is constructed. The accuracy of the model under different feature inputs is compared. The results indicate that different models show obvious differences in F-value, accuracy, precision, and recall. The attention mechanism-gated recurrent convolutional neural network (A-GRCNN) model performs best on all indicators. Specifically, this model’s accuracy, recall, F-value, and precision reach 0.961, 0.958, 0.963, and 0.970. The incorporation of multiple feature inputs can remarkably enhance the accuracy of vocal performance evaluation, especially the combination of constant Q Transform features, which is the most outstanding. This study improves the accuracy and reliability of music information processing, promotes the application of DL technology in music, and contributes to optimizing vocal performance evaluation.https://doi.org/10.1038/s41598-025-99357-2Deep learningNote recognitionVocal performance evaluationAttention mechanismNeural network |
| spellingShingle | Dongyun Chang Vocal performance evaluation of the intelligent note recognition method based on deep learning Scientific Reports Deep learning Note recognition Vocal performance evaluation Attention mechanism Neural network |
| title | Vocal performance evaluation of the intelligent note recognition method based on deep learning |
| title_full | Vocal performance evaluation of the intelligent note recognition method based on deep learning |
| title_fullStr | Vocal performance evaluation of the intelligent note recognition method based on deep learning |
| title_full_unstemmed | Vocal performance evaluation of the intelligent note recognition method based on deep learning |
| title_short | Vocal performance evaluation of the intelligent note recognition method based on deep learning |
| title_sort | vocal performance evaluation of the intelligent note recognition method based on deep learning |
| topic | Deep learning Note recognition Vocal performance evaluation Attention mechanism Neural network |
| url | https://doi.org/10.1038/s41598-025-99357-2 |
| work_keys_str_mv | AT dongyunchang vocalperformanceevaluationoftheintelligentnoterecognitionmethodbasedondeeplearning |