Title :
Transform representation of the spectra of acoustic speech segments with applications. II. Speech analysis, synthesis, and coding
Author :
Algazi, V. Ralph ; Brown, Kathy L. ; Ready, Michael J. ; Irvine, David H. ; Cadwell, Christie L. ; Chung, Sang
Author_Institution :
Center for Image Process. & Integrated Comput., California Univ., Davis, CA, USA
fDate :
7/1/1993 12:00:00 AM
Abstract :
For Part I see ibid., vol.1, no.2, p.180-95 (1993). In Part I of this paper, the authors introduced an approach to the representation of the speech spectral envelope which makes use of the Karhunen-Loeve (KL) transformation of acoustic subword segments. This signal-dependent representation captures, with a few KL vectors and transform coefficients, the perceptually and phonetically important structure of the spectral envelope. Here the authors apply this representation to the analysis, synthesis, and coding of speech. They propose simple quantization and coding strategies for the KL representation vectors as well as for the resulting transform coefficients. The resulting technique is a variable rate encoding scheme which achieves good speech quality at an average rate of 3.5 kb/s
Keywords :
speech analysis and processing; speech coding; speech synthesis; transforms; 3.5 kbit/s; KL vectors; Karhunen-Loeve transformation; acoustic speech segments; acoustic subword segments; quantization; speech analysis; speech coding; speech quality; speech spectral envelope; speech synthesis; transform coefficients; transform representation; variable rate encoding scheme; Acoustic applications; Filters; Image coding; Quantization; Signal synthesis; Speech analysis; Speech coding; Speech processing; Speech synthesis; Vocoders;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on