Minimising latency of pitch detection algorithms for live vocals on low-cost hardware

A pitch estimation device was proposed for live vocals to output appropriate pitch data through the musical instrument digital interface (MIDI). The intention was to ideally achieve unnoticeable latency while maintaining estimation accuracy. The projected target platform was low-cost, standalone har...

Full description

Saved in:
Bibliographic Details
Format: Article
Language:English
Published: University of Huddersfield Press 2016-01-01
Series:Fields
Online Access:https://www.fieldsjournal.org.uk/article/id/445/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850285341740630016
collection DOAJ
description A pitch estimation device was proposed for live vocals to output appropriate pitch data through the musical instrument digital interface (MIDI). The intention was to ideally achieve unnoticeable latency while maintaining estimation accuracy. The projected target platform was low-cost, standalone hardware based around a microcontroller such as the Microchip PIC series. This study investigated, optimised and compared the performance of suitable algorithms for this application. Performance was determined by two key factors: accuracy and latency. Many papers have been published over the past six decades assessing and comparing the accuracy of pitch detection algorithms on various signals, including vocals. However, very little information is available concerning the latency of pitch detection algorithms and methods with which this can be minimised. Real-time audio introduces a further latency challenge that is sparsely studied, minimising the length of sampled audio required by the algorithms in order to reduce overall total latency. Thorough testing was undertaken in order to determine the best-performing algorithm and optimal parameter combination. Software modifications were implemented to facilitate accurate, repeatable, automated testing in order to build a comprehensive set of results encompassing a wide range of test conditions. The results revealed that the infinite-peak-clipping autocorrelation function (IACF) performed better than the other autocorrelation functions tested and also identified ideal parameter values or value ranges to provide the optimal latency/accuracy balance. Although the results were encouraging, testing highlighted some fundamental issues with vocal pitch detection. Potential solutions are proposed for further development.
format Article
id doaj-art-9201ad79ec94425faf7edef7a3eab236
institution OA Journals
issn 2057-0163
language English
publishDate 2016-01-01
publisher University of Huddersfield Press
record_format Article
series Fields
spelling doaj-art-9201ad79ec94425faf7edef7a3eab2362025-08-20T01:47:18ZengUniversity of Huddersfield PressFields2057-01632016-01-012110.5920/fields.2016.2125Minimising latency of pitch detection algorithms for live vocals on low-cost hardwareA pitch estimation device was proposed for live vocals to output appropriate pitch data through the musical instrument digital interface (MIDI). The intention was to ideally achieve unnoticeable latency while maintaining estimation accuracy. The projected target platform was low-cost, standalone hardware based around a microcontroller such as the Microchip PIC series. This study investigated, optimised and compared the performance of suitable algorithms for this application. Performance was determined by two key factors: accuracy and latency. Many papers have been published over the past six decades assessing and comparing the accuracy of pitch detection algorithms on various signals, including vocals. However, very little information is available concerning the latency of pitch detection algorithms and methods with which this can be minimised. Real-time audio introduces a further latency challenge that is sparsely studied, minimising the length of sampled audio required by the algorithms in order to reduce overall total latency. Thorough testing was undertaken in order to determine the best-performing algorithm and optimal parameter combination. Software modifications were implemented to facilitate accurate, repeatable, automated testing in order to build a comprehensive set of results encompassing a wide range of test conditions. The results revealed that the infinite-peak-clipping autocorrelation function (IACF) performed better than the other autocorrelation functions tested and also identified ideal parameter values or value ranges to provide the optimal latency/accuracy balance. Although the results were encouraging, testing highlighted some fundamental issues with vocal pitch detection. Potential solutions are proposed for further development.https://www.fieldsjournal.org.uk/article/id/445/
spellingShingle Minimising latency of pitch detection algorithms for live vocals on low-cost hardware
Fields
title Minimising latency of pitch detection algorithms for live vocals on low-cost hardware
title_full Minimising latency of pitch detection algorithms for live vocals on low-cost hardware
title_fullStr Minimising latency of pitch detection algorithms for live vocals on low-cost hardware
title_full_unstemmed Minimising latency of pitch detection algorithms for live vocals on low-cost hardware
title_short Minimising latency of pitch detection algorithms for live vocals on low-cost hardware
title_sort minimising latency of pitch detection algorithms for live vocals on low cost hardware
url https://www.fieldsjournal.org.uk/article/id/445/