Title :
Harmonic-stochastic excitation (HSX) speech coding below 4 kbit/s
Author :
Laflamme, C. ; Salami, R. ; Matmti, R. ; Adoul, J.-P.
Author_Institution :
Dept. of Electr. Eng., Sherbrooke Univ., Que., Canada
Abstract :
This paper presents an algorithm for encoding speech signals at bit rates below 4 kbit/s based on a mixed harmonic and stochastic modeling of the excitation signal. The algorithm uses robust pitch tracking and efficient voicing analysis to determine the ratios of the harmonic and stochastic components. The harmonic component is synthesized using a bank of bandpass filters while the stochastic component is synthesized using inverse STFT with overlap-and-add. Postfiltering is utilized at the decoder to enhance the quality of synthesized speech. A 2.4 kbit/s version of the algorithm was formally tested and the DAM and DRT scores showed that the coder performance is comparable to that of DoD 4.8 kbit/s Federal Standard FS-1016
Keywords :
Fourier transforms; band-pass filters; harmonic analysis; linear predictive coding; speech coding; speech processing; speech synthesis; stochastic processes; 2.4 kbit/s; 4 kbit/s; DAM score; DRT score; HSX algorithm; bandpass filters; efficient voicing analysis; harmonic-stochastic excitation; inverse STFT; low bit rate; overlap-and-add; postfiltering; robust pitch tracking; speech coding; synthesized speech quality; Algorithm design and analysis; Bit rate; Encoding; Harmonic analysis; Power harmonic filters; Robustness; Signal synthesis; Speech coding; Speech synthesis; Stochastic processes;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.540326