DocumentCode :
937175
Title :
Transform representation of the spectra of acoustic speech segments with applications. II. Speech analysis, synthesis, and coding
Author :
Algazi, V. Ralph ; Brown, Kathy L. ; Ready, Michael J. ; Irvine, David H. ; Cadwell, Christie L. ; Chung, Sang
Author_Institution :
Center for Image Process. & Integrated Comput., California Univ., Davis, CA, USA
Volume :
1
Issue :
3
fYear :
1993
fDate :
7/1/1993 12:00:00 AM
Firstpage :
277
Lastpage :
286
Abstract :
For Part I see ibid., vol.1, no.2, p.180-95 (1993). In Part I of this paper, the authors introduced an approach to the representation of the speech spectral envelope which makes use of the Karhunen-Loeve (KL) transformation of acoustic subword segments. This signal-dependent representation captures, with a few KL vectors and transform coefficients, the perceptually and phonetically important structure of the spectral envelope. Here the authors apply this representation to the analysis, synthesis, and coding of speech. They propose simple quantization and coding strategies for the KL representation vectors as well as for the resulting transform coefficients. The resulting technique is a variable rate encoding scheme which achieves good speech quality at an average rate of 3.5 kb/s
Keywords :
speech analysis and processing; speech coding; speech synthesis; transforms; 3.5 kbit/s; KL vectors; Karhunen-Loeve transformation; acoustic speech segments; acoustic subword segments; quantization; speech analysis; speech coding; speech quality; speech spectral envelope; speech synthesis; transform coefficients; transform representation; variable rate encoding scheme; Acoustic applications; Filters; Image coding; Quantization; Signal synthesis; Speech analysis; Speech coding; Speech processing; Speech synthesis; Vocoders;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.232611
Filename :
232611
Link To Document :
بازگشت