DocumentCode :
2551935
Title :
Design and performance of a 4.0 kbit/s speech coder based on frequency-domain interpolation
Author :
Bhaskar, U. ; Nandkumar, S. ; Swaminathan, K.
Author_Institution :
Hughes Network Syst. Inc., Germantown, MD, USA
fYear :
2000
fDate :
2000
Firstpage :
8
Lastpage :
10
Abstract :
The 4.0 kbit/s speech codec described is based on a frequency domain interpolative (FDI) coding technique, which belongs to the class of prototype waveform interpolation (PWI) coding techniques. The codec also has an integrated voice activity detector (VAD) and a noise reduction capability. The input signal is subjected to LPC analysis and the prediction residual is separated into a slowly evolving waveform (SEW) and a rapidly evolving waveform (REW) component. The SEW magnitude component is quantized using a hierarchical predictive vector quantization approach. The REW magnitude is quantized using a gain and a sub-band based shape. The SEW and REW phases are derived at the decoder using a phase model, based on a transmitted measure of voice periodicity. The spectral (LSP) parameters are quantized using a combination of scalar and vector quantizers. The 4.0 kbits/s coder has an algorithmic delay of 60 ms and an estimated floating point complexity of 21.5 MIPS. The performance of this coder has been evaluated using in-house MOS tests under various conditions such as background noise, channel errors, self-tandem, and DTX mode of operation, and has been shown to be statistically equivalent to ITU-T G.729 8 kbps codec across all conditions tested
Keywords :
frequency-domain synthesis; interpolation; linear predictive coding; spectral analysis; speech codecs; telecommunication equipment testing; vector quantisation; vocoders; 21.5 MIPS; 4 kbit/s; 60 ms; 8 kbit/s; DTX mode; ITU-T G.729 codec; LPC analysis; MOS tests; REW magnitude; SEW magnitude component; algorithmic delay; background noise; channel errors; decoder; floating point complexity; frequency domain interpolative coding; frequency-domain interpolation; gain; hierarchical predictive vector quantization; input signal; noise reduction; performance; phase model; prediction residual; prototype waveform interpolation coding; rapidly evolving waveform; scalar quantizer; self-tandem; slowly evolving waveform; spectral parameters; speech codec; speech coder design; speech coder performance; sub-band based shape; voice activity detector; voice periodicity measure; Automatic testing; Delay estimation; Detectors; Fault detection; Frequency domain analysis; Interpolation; Linear predictive coding; Noise reduction; Prototypes; Speech codecs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech Coding, 2000. Proceedings. 2000 IEEE Workshop on
Conference_Location :
Delavan, WI
Print_ISBN :
0-7803-6416-3
Type :
conf
DOI :
10.1109/SCFT.2000.878376
Filename :
878376
Link To Document :
بازگشت