DocumentCode
2291642
Title
A novel short term prediction method for speech using Haar based time varying models
Author
Joy, Deepa ; Kumar, R. V Raja ; Pathak, S.S.
Author_Institution
Dept. of E & ECE, Indian Inst. of Technol., Kharagpur, India
fYear
2011
fDate
15-17 Sept. 2011
Firstpage
142
Lastpage
145
Abstract
Speech is a non-stationary signal. The non-stationarity of the speech arises from emotional variations, speaker and environment variations. Almost all of the speech coding standards available today rely on stationary models for the modelling of time varying parameters of the speech generation model which affects the perceptional quality of the coded speech. Physically non-stationarity can be interpreted as the manifestation of the time varying nature of the speech generation source-the vocal tract. The vocal tract can be roughly modelled as a Autoregressive (AR) filter(all pole model). The time varying nature of the vocal tract corresponds to the time-varying AR parameters. The time varying AR parameters are expressed as the sum of Haar wavelets. Thus the estimation of time varying AR parameters boils down to that of finding the time invariant coefficients of the Haar wavelet basis functions. Here we propose a variable bit rate codec which attempts to bring in the non stationary modelling of the time varying AR parameters of the speech. Further long term prediction(LTP) and AbS method can be incorporated to develop a codec using this short term prediction method. It can be seen that the Haar wavelet based speech coding method over-performs the traditional method.
Keywords
Haar transforms; autoregressive processes; speech coding; wavelet transforms; Haar based time varying models; Haar wavelet basis functions; all pole model; autoregressive filter; long term prediction; nonstationary signal; short term prediction; speech coding standards; speech generation model; stationary model; time varying parameters; variable bit rate codec; Bit rate; Codecs; Estimation; Predictive models; Speech; Speech coding; Speech enhancement;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Communication Technology (ICCCT), 2011 2nd International Conference on
Conference_Location
Allahabad
Print_ISBN
978-1-4577-1385-9
Type
conf
DOI
10.1109/ICCCT.2011.6075174
Filename
6075174
Link To Document