مرکز منطقه ای اطلاع رساني علوم و فناوري - Design and performance of a 4.0 kbit/s speech coder based on frequency-domain interpolation

DocumentCode :

2551935

Title :

Design and performance of a 4.0 kbit/s speech coder based on frequency-domain interpolation

Author :

Bhaskar, U. ; Nandkumar, S. ; Swaminathan, K.

Author_Institution :

Hughes Network Syst. Inc., Germantown, MD, USA

fYear :

2000

fDate :

2000

Firstpage :

Lastpage :

Abstract :

The 4.0 kbit/s speech codec described is based on a frequency domain interpolative (FDI) coding technique, which belongs to the class of prototype waveform interpolation (PWI) coding techniques. The codec also has an integrated voice activity detector (VAD) and a noise reduction capability. The input signal is subjected to LPC analysis and the prediction residual is separated into a slowly evolving waveform (SEW) and a rapidly evolving waveform (REW) component. The SEW magnitude component is quantized using a hierarchical predictive vector quantization approach. The REW magnitude is quantized using a gain and a sub-band based shape. The SEW and REW phases are derived at the decoder using a phase model, based on a transmitted measure of voice periodicity. The spectral (LSP) parameters are quantized using a combination of scalar and vector quantizers. The 4.0 kbits/s coder has an algorithmic delay of 60 ms and an estimated floating point complexity of 21.5 MIPS. The performance of this coder has been evaluated using in-house MOS tests under various conditions such as background noise, channel errors, self-tandem, and DTX mode of operation, and has been shown to be statistically equivalent to ITU-T G.729 8 kbps codec across all conditions tested

Keywords :

frequency-domain synthesis; interpolation; linear predictive coding; spectral analysis; speech codecs; telecommunication equipment testing; vector quantisation; vocoders; 21.5 MIPS; 4 kbit/s; 60 ms; 8 kbit/s; DTX mode; ITU-T G.729 codec; LPC analysis; MOS tests; REW magnitude; SEW magnitude component; algorithmic delay; background noise; channel errors; decoder; floating point complexity; frequency domain interpolative coding; frequency-domain interpolation; gain; hierarchical predictive vector quantization; input signal; noise reduction; performance; phase model; prediction residual; prototype waveform interpolation coding; rapidly evolving waveform; scalar quantizer; self-tandem; slowly evolving waveform; spectral parameters; speech codec; speech coder design; speech coder performance; sub-band based shape; voice activity detector; voice periodicity measure; Automatic testing; Delay estimation; Detectors; Fault detection; Frequency domain analysis; Interpolation; Linear predictive coding; Noise reduction; Prototypes; Speech codecs;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Speech Coding, 2000. Proceedings. 2000 IEEE Workshop on

Conference_Location :

Delavan, WI

Print_ISBN :

0-7803-6416-3

Type :

conf

DOI :

10.1109/SCFT.2000.878376

Filename :

878376

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2551935