Vocal performance evaluation of the intelligent note recognition method based on deep learning

Abstract This study aims to optimize the ability of note recognition and improve the accuracy of vocal performance evaluation. Firstly, the basic theory of music is analyzed. Secondly, the convolutional neural network (CNN) in deep learning (DL) is selected to integrate gated recurrent units for opt...

Full description

Saved in:
Bibliographic Details
Main Author: Dongyun Chang
Format: Article
Language:English
Published: Nature Portfolio 2025-04-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-99357-2
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract This study aims to optimize the ability of note recognition and improve the accuracy of vocal performance evaluation. Firstly, the basic theory of music is analyzed. Secondly, the convolutional neural network (CNN) in deep learning (DL) is selected to integrate gated recurrent units for optimization. Moreover, the attention mechanism is added to the optimized model to implement an intelligent note recognition model, and the results of note recognition are compared with those of common models. Finally, according to the results of audio signal classification, a vocal performance evaluation model optimized based on the attention mechanism is constructed. The accuracy of the model under different feature inputs is compared. The results indicate that different models show obvious differences in F-value, accuracy, precision, and recall. The attention mechanism-gated recurrent convolutional neural network (A-GRCNN) model performs best on all indicators. Specifically, this model’s accuracy, recall, F-value, and precision reach 0.961, 0.958, 0.963, and 0.970. The incorporation of multiple feature inputs can remarkably enhance the accuracy of vocal performance evaluation, especially the combination of constant Q Transform features, which is the most outstanding. This study improves the accuracy and reliability of music information processing, promotes the application of DL technology in music, and contributes to optimizing vocal performance evaluation.
ISSN:2045-2322