A speech compression method without utilizing signal prediction

Previous speech compression methods for practical purposes had been based on signal prediction, taking the auditory functions into account but overlooking features specific to speech signals. A new method was developed in which amplitude envelopes in four frequency bands corresponding to spectral fa...

Full description

Saved in:
Bibliographic Details
Main Authors: Ikuo Matsuo, Kazuo Ueda, Yoshitaka Nakajima
Format: Article
Language:English
Published: SAGE Publishing 2025-05-01
Series:i-Perception
Online Access:https://doi.org/10.1177/20416695251340236
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Previous speech compression methods for practical purposes had been based on signal prediction, taking the auditory functions into account but overlooking features specific to speech signals. A new method was developed in which amplitude envelopes in four frequency bands corresponding to spectral factors common to different languages were used to modulate infinitely peak-clipped signals, which also had been revealed to contain useful linguistic information. In a pilot experiment, intelligibility reached ~80% with limited information of only 2,400 bits per second (bps), whereas the bit rate of the original signal was 256,000 bps. This algorithm preserves the naturalness of speech and is easy to grasp intuitively.
ISSN:2041-6695