Title :
1.2 kbit/s harmonic coder using auditory filters
Author_Institution :
Chiba Inst. of Technol., Narashino, Japan
Abstract :
In this paper, a very low bit speech coder at 1.2 kbps is newly proposed. Like the LPC vocoder, it only requires gain, pitch, and spectral information, but its quality is far superior. The synthesis method is one of harmonic coding, using sinusoids whose frequencies are multiples of the fundamental frequency, where the amplitudes of the sinusoids are adaptively modulated using gammatone filters as a perceptual weighting filter. The sinusoids´ phases are also adjusted so as to maximize the perceptual quality. In order to reduce the total bit rate to 1.2 kbit/s, a new segment coder for spectral information (LSP coefficients) using DP matching is also proposed. The quality of the synthesized speech was improved by 0.45 in the mean opinion score (MOS) compared with that of the simple LPC vocoder operating at the same rate, and it was comparable to that of 2.4 kbit/s MELP coder
Keywords :
filtering theory; harmonic analysis; hearing; linear predictive coding; quantisation (signal); speech synthesis; vocoders; 1.2 kbit/s; 2.4 kbit/s; DP matching; LPC vocoder; LSP coefficients; MELP coder; adaptive modulation; amplitudes; auditory filters; bit rate reduction; fundamental frequency; gain; gammatone filters; harmonic coder; harmonic coding; mean opinion score; perceptual quality; perceptual weighting filter; phases; pitch; quantization; segment coder; sinusoids; spectral information; synthesis method; synthesized speech quality; very low bit speech coder; Adaptive filters; Amplitude modulation; Bit rate; Frequency synthesizers; Linear predictive coding; Phase modulation; Power harmonic filters; Signal synthesis; Speech synthesis; Vocoders;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.758164