DocumentCode
3159255
Title
Low bit-rate speech coder based on a long-term model
Author
Lev, O. ; Malah, David
Author_Institution
Dept. of Electr. Eng., Technion-Israel Inst. of Technol., Haifa, Israel
fYear
2002
fDate
1 Dec. 2002
Firstpage
16
Lastpage
18
Abstract
We present a low bit rate speech coder based on a long-term model (LTM) for voiced speech, and on the WI coder. In the LTM, a periodic input signal undergoes a time-varying spectral shaping representing the evolution of the pitch-cycle waveform. The resulting signal, which has a fixed pitch period but a time-varying pitch-cycle waveform, is multiplied by a time-varying gain function that represents the variation in speech loudness. The resulting signal then undergoes a time-axis warping, which represents the evolution of the pitch period, yielding the output speech signal. The spectral shaping in the proposed coder is based on WI. In WI, speech (or LPC residual) is observed as a continuously evolving sequence of pitch cycle waveforms. A subset of these waveforms is extracted and coded. In the decoder, after inverse quantization, missing waveforms are synthesized by interpolation. The extracted waveforms are normalized to a fixed length and sequentially aligned using a cyclical shift. Then, a two-dimensional surface, called prototype waveform surface or characteristic waveform (CW) is produced from these waveforms.
Keywords
interpolation; linear predictive coding; loudness; spectral analysis; speech coding; time-varying systems; LPC residual; WI coder; characteristic waveform; continuously evolving sequence; cyclical shift; fixed pitch period; interpolation; inverse quantization; long-term model; low bit rate speech coder; pitch-cycle waveform evolution; prototype waveform surface; speech loudness variation; time-axis warping; time-varying gain function; time-varying spectral shaping; two-dimensional surface; voiced speech; waveform synthesis; Bit rate; Decoding; Energy states; Gaussian noise; Interpolation; Linear predictive coding; Signal analysis; Speech analysis; Speech synthesis; Surface waves;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical and Electronics Engineers in Israel, 2002. The 22nd Convention of
Print_ISBN
0-7803-7693-5
Type
conf
DOI
10.1109/EEEI.2002.1178295
Filename
1178295
Link To Document