DocumentCode :
1939373
Title :
Introducing compact: An oscillator-based approach to toll-quality speech coding at low bit rates
Author :
Yen, Anton Y. ; Gorodnitsky, Irina
Author_Institution :
SPAWAR Syst. Center Pacific, San Diego, CA, USA
fYear :
2010
fDate :
Oct. 31 2010-Nov. 3 2010
Firstpage :
293
Lastpage :
297
Abstract :
In this paper, we introduce an improved oscillator model we term the Complete Oscillator Model (COM). A significant advantage of the COM over classical oscillators such as the Self Excited Vocoder is that it is not restricted to modeling only certain larger-scale patterns in the source sequence. Here, we develop a speech coding system based on the proposed COM. In this system, the COM is used in combination with a linear predictor, the Pulsed Autoregressive CompensaTor (PACT), to develop a novel, oscillator-based approach to toll-quality speech coding at low bit rates. Unlike the linear prediction-based models utilized in modern speech coders, oscillators do not depend on an estimate of the residual error to regenerate the signal. As such, the residual is encoded only for select frames, providing a potential improvement in coding efficiency. An implementation of the hybrid COM/PACT system, which we call COMPACT, is described and is shown to provide both perceptual quality and bit rate that are competitive with mature standards such as G.729 and AMR. The given implementation is demonstrated to produce toll-quality speech, as measured by PESQ-MOS, at 9.77 kbps. Future tuning of this implementation is expected to improve performance to where it could exceed the current state of the art.
Keywords :
oscillators; speech coding; vocoders; AMR; G.729; bit rate 9.77 kbit/s; complete oscillator model; hybrid COM/PACT system; linear predictor; low bit rates; pulsed autoregressive compensator; residual error; self excited vocoder; toll-quality speech coding; Bit rate; Delay; Mathematical model; Oscillators; Signal to noise ratio; Speech; Speech coding; Audio oscillators; speech codecs; speech coding; speech processing; speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
MILITARY COMMUNICATIONS CONFERENCE, 2010 - MILCOM 2010
Conference_Location :
San Jose, CA
ISSN :
2155-7578
Print_ISBN :
978-1-4244-8178-1
Type :
conf
DOI :
10.1109/MILCOM.2010.5680310
Filename :
5680310
Link To Document :
بازگشت