DocumentCode
302108
Title
Harmonic-stochastic excitation (HSX) speech coding below 4 kbit/s
Author
Laflamme, C. ; Salami, R. ; Matmti, R. ; Adoul, J.-P.
Author_Institution
Dept. of Electr. Eng., Sherbrooke Univ., Que., Canada
Volume
1
fYear
1996
fDate
7-10 May 1996
Firstpage
204
Abstract
This paper presents an algorithm for encoding speech signals at bit rates below 4 kbit/s based on a mixed harmonic and stochastic modeling of the excitation signal. The algorithm uses robust pitch tracking and efficient voicing analysis to determine the ratios of the harmonic and stochastic components. The harmonic component is synthesized using a bank of bandpass filters while the stochastic component is synthesized using inverse STFT with overlap-and-add. Postfiltering is utilized at the decoder to enhance the quality of synthesized speech. A 2.4 kbit/s version of the algorithm was formally tested and the DAM and DRT scores showed that the coder performance is comparable to that of DoD 4.8 kbit/s Federal Standard FS-1016
Keywords
Fourier transforms; band-pass filters; harmonic analysis; linear predictive coding; speech coding; speech processing; speech synthesis; stochastic processes; 2.4 kbit/s; 4 kbit/s; DAM score; DRT score; HSX algorithm; bandpass filters; efficient voicing analysis; harmonic-stochastic excitation; inverse STFT; low bit rate; overlap-and-add; postfiltering; robust pitch tracking; speech coding; synthesized speech quality; Algorithm design and analysis; Bit rate; Encoding; Harmonic analysis; Power harmonic filters; Robustness; Signal synthesis; Speech coding; Speech synthesis; Stochastic processes;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1520-6149
Print_ISBN
0-7803-3192-3
Type
conf
DOI
10.1109/ICASSP.1996.540326
Filename
540326
Link To Document