DocumentCode :
2291642
Title :
A novel short term prediction method for speech using Haar based time varying models
Author :
Joy, Deepa ; Kumar, R. V Raja ; Pathak, S.S.
Author_Institution :
Dept. of E & ECE, Indian Inst. of Technol., Kharagpur, India
fYear :
2011
fDate :
15-17 Sept. 2011
Firstpage :
142
Lastpage :
145
Abstract :
Speech is a non-stationary signal. The non-stationarity of the speech arises from emotional variations, speaker and environment variations. Almost all of the speech coding standards available today rely on stationary models for the modelling of time varying parameters of the speech generation model which affects the perceptional quality of the coded speech. Physically non-stationarity can be interpreted as the manifestation of the time varying nature of the speech generation source-the vocal tract. The vocal tract can be roughly modelled as a Autoregressive (AR) filter(all pole model). The time varying nature of the vocal tract corresponds to the time-varying AR parameters. The time varying AR parameters are expressed as the sum of Haar wavelets. Thus the estimation of time varying AR parameters boils down to that of finding the time invariant coefficients of the Haar wavelet basis functions. Here we propose a variable bit rate codec which attempts to bring in the non stationary modelling of the time varying AR parameters of the speech. Further long term prediction(LTP) and AbS method can be incorporated to develop a codec using this short term prediction method. It can be seen that the Haar wavelet based speech coding method over-performs the traditional method.
Keywords :
Haar transforms; autoregressive processes; speech coding; wavelet transforms; Haar based time varying models; Haar wavelet basis functions; all pole model; autoregressive filter; long term prediction; nonstationary signal; short term prediction; speech coding standards; speech generation model; stationary model; time varying parameters; variable bit rate codec; Bit rate; Codecs; Estimation; Predictive models; Speech; Speech coding; Speech enhancement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Communication Technology (ICCCT), 2011 2nd International Conference on
Conference_Location :
Allahabad
Print_ISBN :
978-1-4577-1385-9
Type :
conf
DOI :
10.1109/ICCCT.2011.6075174
Filename :
6075174
Link To Document :
بازگشت