DocumentCode :
807586
Title :
A mixed excitation LPC vocoder model for low bit rate speech coding
Author :
McCree, Alan V. ; Barnwell, Thomas P., III
Author_Institution :
Sch. of Electr. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Volume :
3
Issue :
4
fYear :
1995
fDate :
7/1/1995 12:00:00 AM
Firstpage :
242
Lastpage :
250
Abstract :
Traditional pitch-excited linear predictive coding (LPC) vocoders use a fully parametric model to efficiently encode the important information in human speech. These vocoders can produce intelligible speech at low data rates (800-2400 b/s), but they often sound synthetic and generate annoying artifacts such as buzzes, thumps, and tonal noises. These problems increase dramatically if acoustic background noise is present at the speech input. This paper presents a new mixed excitation LPC vocoder model that preserves the low bit rate of a fully parametric model but adds more free parameters to the excitation signal so that the synthesizer can mimic more characteristics of natural human speech. The new model also eliminates the traditional requirement for a binary voicing decision so that the vocoder performs well even in the presence of acoustic background noise. A 2400-b/s LPC vocoder based on this model has been developed and implemented in simulations and in a real-time system. Formal subjective testing of this coder confirms that it produces natural sounding speech even in a difficult noise environment. In fact, diagnostic acceptability measure (DAM) test scores show that the performance of the 2400-b/s mixed excitation LPC vocoder is close to that of the government standard 4800-b/s CELP coder
Keywords :
acoustic noise; linear predictive coding; speech codecs; speech coding; speech synthesis; vocoders; acoustic background noise; buzzes; diagnostic acceptability measure test scores; fully parametric model; human speech encoding; low bit rate speech coding; mixed excitation LPC vocoder model; performance; pitch-excited LPC vocoders; real-time system; subjective testing; thumps; tonal noises; Acoustic noise; Acoustic testing; Background noise; Bit rate; Humans; Linear predictive coding; Parametric statistics; Speech coding; Speech enhancement; Vocoders;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.397089
Filename :
397089
Link To Document :
بازگشت