DocumentCode
3521476
Title
Phase coherence in speech reconstruction for enhancement and coding applications
Author
Quatieri, Thomas F. ; McAulay, Robert J.
Author_Institution
MIT Lincoln Lab., Lexington, MA, USA
fYear
1989
fDate
23-26 May 1989
Firstpage
207
Abstract
It has been shown that an analysis-synthesis system based on a sinusoidal representation leads to synthetic speech that is essentially perceptually indistinguishable from the original. A change in speech quality has been observed, however, when the phase relation of the sine waves is altered. This occurs in practice when sine waves are processed for speech enhancement and for speech coding. A description is given of a zero-phase sinusoidal analysis-synthesis system which generates natural-sounding speech without the requirement of vocal tract phase. The method provides a basis for improving sound quality by providing different levels of phase coherence in speech reconstruction for time-scale modification, for a baseline system for coding, and for reducing the peak-to-RMS ratio by dispersion
Keywords
encoding; speech analysis and processing; speech synthesis; baseline system; dispersion; natural-sounding speech; original; peak-to-RMS ratio; perceptually indistinguishable; phase coherence; phase relation; sine waves; sinusoidal representation; sound quality; speech coding; speech enhancement; speech quality; speech reconstruction; synthetic speech; time-scale modification; zero-phase sinusoidal analysis-synthesis system; Coherence; Frequency; Laboratories; Pulse shaping methods; Shape control; Speech analysis; Speech coding; Speech enhancement; Speech synthesis; Time varying systems;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location
Glasgow
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.1989.266401
Filename
266401
Link To Document