DocumentCode :
294621
Title :
Variable dimension spectral coding of speech at 2400 bps and below with phonetic classification
Author :
Das, Amitava ; Gersho, Allen
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., Santa Barbara, CA, USA
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
492
Abstract :
The low bit rate enhanced multiband excitation or EMBE speech coder adds several important new features including phonetic classification and a naval spectral quantization technique called variable dimension vector quantization (VDVQ) to the basic multiband excitation vocoder. Phonetic classification allows the adaptation of spectral modeling and quantization to the local acoustic-phonetic character of the speech signal, enhancing quality and robustness. The VDVQ scheme quantizes the log-spectrum with relatively few bits while preserving perceptually important features. Both the fixed rate (2.4 kb/s) and the variable rate (1.44 kb/s average) implementations of EMBE deliver speech quality comparable to the 4.8 kb/s Federal Standard 1016 CELP coder and the 4.15 kb/s Inmarsat-M standard IMBE coder
Keywords :
acoustic signal processing; spectral analysis; speech coding; variable rate codes; vector quantisation; vocoders; 1.44 kbit/s; 2400 bit/s; 4.15 kbit/s; 4.8 kbit/s; EMBE speech coder; Federal Standard 1016 CELP coder; Inmarsat-M standard IMBE coder; fixed rate coding; local acoustic-phonetic character; log-spectrum; low bit rate enhanced multiband excitation; multiband excitation vocoder; perceptually important features; phonetic classification; spectral modeling; spectral quantization; speech coding; speech quality; speech signal; variable dimension spectral coding; variable dimension vector quantization; variable rate coding; Adaptation model; Bit rate; Code standards; Noise shaping; Signal processing; Space technology; Speech coding; Speech enhancement; Speech processing; Speech synthesis; Vector quantization; Vocoders;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479636
Filename :
479636
Link To Document :
بازگشت