Title :
Filter-bank optimization for Frequency Domain Linear Prediction
Author :
Peddinti, Vijayaditya ; Hermansky, Hynek
Author_Institution :
Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
The sub-band Frequency Domain Linear Prediction (FDLP) technique estimates autoregressive models of Hilbert envelopes of subband signals, from segments of discrete cosine transform (DCT) of a speech signal, using windows. Shapes of the windows and their positions on the cosine transform of the signal determine implied filtering of the signal. Thus, the choices of shape, position and number of these windows can be critical for the performance of the FDLP technique. So far, we have used Gaussian or rectangular windows. In this paper asymmetric cochlear-like filters are being studied. Further, a frequency differentiation operation, that introduces an additional set of parameters describing local spectral slope in each frequency sub-band, is introduced to increase the robustness of sub-band envelopes in noise. The performance gains achieved by these changes are reported in a variety of additive noise conditions, with an average relative improvement of 8.04% in phoneme recognition accuracy.
Keywords :
Gaussian processes; Hilbert transforms; autoregressive processes; channel bank filters; discrete cosine transforms; frequency-domain analysis; prediction theory; speech recognition; DCT; FDLP technique; Gaussian windows; Hilbert envelopes; additive noise conditions; asymmetric cochlear-like filters; autoregressive models; discrete cosine transform; filter-bank optimization; frequency differentiation operation; local spectral slope; phoneme recognition accuracy; rectangular windows; signal filtering; speech signal; sub-band frequency domain linear prediction; subband envelopes; subband signals; windows shape; Accuracy; Discrete cosine transforms; Noise; Robustness; Speech; Speech processing; Speech recognition; cochlear filters; robust speech recognition; spectral differentiation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639040