Vowel recognition based on acoustic and visual features

The aim of the research work presented is to show a system that may facilitate speech training for hearing impaired people. The system engineered combines both acoustic and visual vowel data acquisition and analysis modules. The acoustic feature extraction involves mel-cepstral analysis. The Active...

Full description

Saved in:
Bibliographic Details
Main Authors: P. DALKA, B. KOSTEK, A. CZYŻEWSKI
Format: Article
Language:English
Published: Institute of Fundamental Technological Research Polish Academy of Sciences 2014-04-01
Series:Archives of Acoustics
Subjects:
Online Access:https://acoustics.ippt.pan.pl/index.php/aa/article/view/673
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The aim of the research work presented is to show a system that may facilitate speech training for hearing impaired people. The system engineered combines both acoustic and visual vowel data acquisition and analysis modules. The acoustic feature extraction involves mel-cepstral analysis. The Active Shape Model method is used for extracting visual speech features from the shape and movement of the lips. Artificial Neural Networks (ANNs) are utilized as the classifier, feature vectors extracted combine both modalities of the human speech. The system is validated with the recordings of speakers that were not used for the lip model creating and for the ANN training. Additional experiments with the degraded acoustic information are carried out in order to test the system robustness against various distortions affecting speech utterances.
ISSN:0137-5075
2300-262X