The Laryngovibrogram as a normalized spatiotemporal representation of vocal fold dynamics
Abstract Laryngeal high-speed video (HSV)-endoscopy allows for fast, non-invasive diagnosis of voice disorders and forms the basis for a comprehensive quantitative analysis of the vocal folds’ (VFs’) spatiotemporal vibrational behavior. Previous approaches, such as the Phonovibrogram (PVG), describe...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-05-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-00966-8 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850273021432954880 |
|---|---|
| author | Mona Kirstin Fehling Maria Schuster Maximilian Linxweiler Jörg Lohscheller |
| author_facet | Mona Kirstin Fehling Maria Schuster Maximilian Linxweiler Jörg Lohscheller |
| author_sort | Mona Kirstin Fehling |
| collection | DOAJ |
| description | Abstract Laryngeal high-speed video (HSV)-endoscopy allows for fast, non-invasive diagnosis of voice disorders and forms the basis for a comprehensive quantitative analysis of the vocal folds’ (VFs’) spatiotemporal vibrational behavior. Previous approaches, such as the Phonovibrogram (PVG), describe the vibrational behavior of vocal folds (VFs) based exclusively on the time-varying glottal opening. However, focusing solely on the glottal area overlooks the full extent and dynamic behavior of the VF tissue, factors that are crucial for the voice production process. This complicates clinical interpretation and, thus, the comparability of vibrational dynamics in both cross-sectional and longitudinal interventional studies. To address these limitations, this work aims to extend the PVG to provide a more comprehensive representation of the vibrational behavior across the entire VF tissue. Here, we present the Laryngovibrogram (LVG), which is obtained by segmenting not only the glottal area but also the VFs’ tissue, providing a compact quantitative representation of the VFs’ vibrational behavior. The potential of the proposed LVG representation was investigated on 73 HSV recordings from healthy (38 HSVs) and pathological subjects (35 HSVs) in stationary as well as non-stationary phonations. It is demonstrated that the LVG reliably maps the vibrational behavior along the entire length of the VFs tissue for both physiological and pathological phonations. Compared to PVG-based measures, LVG-based measures exhibited greater stability in healthy subjects, allowing for a narrower normative range, and showed stronger effect sizes in differentiating clinical groups, suggesting a more robust assessment of vibratory impairments. By scaling the vibration amplitude relative to the length of the segmented VF tissue, the VF vibrations are normalized, enabling meaningful quantitative intra- and inter-individual comparisons. Additionally, calculating the angle enclosed by the two VFs makes it possible to analyze transient effects that occur during non-stationary phonation maneuvers, such as voice onset. By integrating information about the VF tissue, the LVG introduced here represents a paradigm shift in the analysis of laryngeal dynamics from focusing solely on the glottal area to a holistic analysis of the entire VF kinematics, which might improve pathology detection accuracy, reduce subjective assessment errors, and optimize treatment follow-ups, ultimately enhancing both clinical diagnostics and therapeutic outcomes. |
| format | Article |
| id | doaj-art-65ceb98cfb1b48048938fdd485a67e65 |
| institution | OA Journals |
| issn | 2045-2322 |
| language | English |
| publishDate | 2025-05-01 |
| publisher | Nature Portfolio |
| record_format | Article |
| series | Scientific Reports |
| spelling | doaj-art-65ceb98cfb1b48048938fdd485a67e652025-08-20T01:51:38ZengNature PortfolioScientific Reports2045-23222025-05-0115112510.1038/s41598-025-00966-8The Laryngovibrogram as a normalized spatiotemporal representation of vocal fold dynamicsMona Kirstin Fehling0Maria Schuster1Maximilian Linxweiler2Jörg Lohscheller3Department of Computer Science, Trier University of Applied SciencesDepartment of Otorhinolaryngology and Head and Neck Surgery, Ludwig Maximilian University of MunichDepartment of Otorhinolaryngology, Head and Neck Surgery, Saarland University Medical Center / Saarland University Faculty of MedicineDepartment of Computer Science, Trier University of Applied SciencesAbstract Laryngeal high-speed video (HSV)-endoscopy allows for fast, non-invasive diagnosis of voice disorders and forms the basis for a comprehensive quantitative analysis of the vocal folds’ (VFs’) spatiotemporal vibrational behavior. Previous approaches, such as the Phonovibrogram (PVG), describe the vibrational behavior of vocal folds (VFs) based exclusively on the time-varying glottal opening. However, focusing solely on the glottal area overlooks the full extent and dynamic behavior of the VF tissue, factors that are crucial for the voice production process. This complicates clinical interpretation and, thus, the comparability of vibrational dynamics in both cross-sectional and longitudinal interventional studies. To address these limitations, this work aims to extend the PVG to provide a more comprehensive representation of the vibrational behavior across the entire VF tissue. Here, we present the Laryngovibrogram (LVG), which is obtained by segmenting not only the glottal area but also the VFs’ tissue, providing a compact quantitative representation of the VFs’ vibrational behavior. The potential of the proposed LVG representation was investigated on 73 HSV recordings from healthy (38 HSVs) and pathological subjects (35 HSVs) in stationary as well as non-stationary phonations. It is demonstrated that the LVG reliably maps the vibrational behavior along the entire length of the VFs tissue for both physiological and pathological phonations. Compared to PVG-based measures, LVG-based measures exhibited greater stability in healthy subjects, allowing for a narrower normative range, and showed stronger effect sizes in differentiating clinical groups, suggesting a more robust assessment of vibratory impairments. By scaling the vibration amplitude relative to the length of the segmented VF tissue, the VF vibrations are normalized, enabling meaningful quantitative intra- and inter-individual comparisons. Additionally, calculating the angle enclosed by the two VFs makes it possible to analyze transient effects that occur during non-stationary phonation maneuvers, such as voice onset. By integrating information about the VF tissue, the LVG introduced here represents a paradigm shift in the analysis of laryngeal dynamics from focusing solely on the glottal area to a holistic analysis of the entire VF kinematics, which might improve pathology detection accuracy, reduce subjective assessment errors, and optimize treatment follow-ups, ultimately enhancing both clinical diagnostics and therapeutic outcomes.https://doi.org/10.1038/s41598-025-00966-8LaryngovibrogramPhonovibrogramLaryngeal high-speed videoendoscopyVoice disorders |
| spellingShingle | Mona Kirstin Fehling Maria Schuster Maximilian Linxweiler Jörg Lohscheller The Laryngovibrogram as a normalized spatiotemporal representation of vocal fold dynamics Scientific Reports Laryngovibrogram Phonovibrogram Laryngeal high-speed videoendoscopy Voice disorders |
| title | The Laryngovibrogram as a normalized spatiotemporal representation of vocal fold dynamics |
| title_full | The Laryngovibrogram as a normalized spatiotemporal representation of vocal fold dynamics |
| title_fullStr | The Laryngovibrogram as a normalized spatiotemporal representation of vocal fold dynamics |
| title_full_unstemmed | The Laryngovibrogram as a normalized spatiotemporal representation of vocal fold dynamics |
| title_short | The Laryngovibrogram as a normalized spatiotemporal representation of vocal fold dynamics |
| title_sort | laryngovibrogram as a normalized spatiotemporal representation of vocal fold dynamics |
| topic | Laryngovibrogram Phonovibrogram Laryngeal high-speed videoendoscopy Voice disorders |
| url | https://doi.org/10.1038/s41598-025-00966-8 |
| work_keys_str_mv | AT monakirstinfehling thelaryngovibrogramasanormalizedspatiotemporalrepresentationofvocalfolddynamics AT mariaschuster thelaryngovibrogramasanormalizedspatiotemporalrepresentationofvocalfolddynamics AT maximilianlinxweiler thelaryngovibrogramasanormalizedspatiotemporalrepresentationofvocalfolddynamics AT jorglohscheller thelaryngovibrogramasanormalizedspatiotemporalrepresentationofvocalfolddynamics AT monakirstinfehling laryngovibrogramasanormalizedspatiotemporalrepresentationofvocalfolddynamics AT mariaschuster laryngovibrogramasanormalizedspatiotemporalrepresentationofvocalfolddynamics AT maximilianlinxweiler laryngovibrogramasanormalizedspatiotemporalrepresentationofvocalfolddynamics AT jorglohscheller laryngovibrogramasanormalizedspatiotemporalrepresentationofvocalfolddynamics |