Improving Speech Recognition Rate through Analysis Parameters

Speech signal is redundant and non-stationary by nature. Because of vocal tract inertness these variations are not very rapid and the signal can be considered as stationary in short segments. It is presumed that in short-time magnitude spectrum the most distinct information of speech is contained. T...

Full description

Saved in:
Bibliographic Details
Main Authors: Eringis Deividas, Tamulevičius Gintautas
Format: Article
Language:English
Published: Riga Technical University Press 2014-05-01
Series:Electrical, Control and Communication Engineering
Subjects:
Online Access:https://doi.org/10.2478/ecce-2014-0009
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Speech signal is redundant and non-stationary by nature. Because of vocal tract inertness these variations are not very rapid and the signal can be considered as stationary in short segments. It is presumed that in short-time magnitude spectrum the most distinct information of speech is contained. This is the main reason for speech signal analysis in frame-by-frame manner. The analyzed speech signal is segmented into overlapping segments (so-called frames) for this purpose. Segments of 15-25 ms with the overlap of 10-15 ms are used usually.
ISSN:2255-9159