Title :
A mixed excitation LPC vocoder model for low bit rate speech coding
Author :
McCree, Alan V. ; Barnwell, Thomas P., III
Author_Institution :
Sch. of Electr. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
fDate :
7/1/1995 12:00:00 AM
Abstract :
Traditional pitch-excited linear predictive coding (LPC) vocoders use a fully parametric model to efficiently encode the important information in human speech. These vocoders can produce intelligible speech at low data rates (800-2400 b/s), but they often sound synthetic and generate annoying artifacts such as buzzes, thumps, and tonal noises. These problems increase dramatically if acoustic background noise is present at the speech input. This paper presents a new mixed excitation LPC vocoder model that preserves the low bit rate of a fully parametric model but adds more free parameters to the excitation signal so that the synthesizer can mimic more characteristics of natural human speech. The new model also eliminates the traditional requirement for a binary voicing decision so that the vocoder performs well even in the presence of acoustic background noise. A 2400-b/s LPC vocoder based on this model has been developed and implemented in simulations and in a real-time system. Formal subjective testing of this coder confirms that it produces natural sounding speech even in a difficult noise environment. In fact, diagnostic acceptability measure (DAM) test scores show that the performance of the 2400-b/s mixed excitation LPC vocoder is close to that of the government standard 4800-b/s CELP coder
Keywords :
acoustic noise; linear predictive coding; speech codecs; speech coding; speech synthesis; vocoders; acoustic background noise; buzzes; diagnostic acceptability measure test scores; fully parametric model; human speech encoding; low bit rate speech coding; mixed excitation LPC vocoder model; performance; pitch-excited LPC vocoders; real-time system; subjective testing; thumps; tonal noises; Acoustic noise; Acoustic testing; Background noise; Bit rate; Humans; Linear predictive coding; Parametric statistics; Speech coding; Speech enhancement; Vocoders;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on