DocumentCode
1688480
Title
Filter-bank optimization for Frequency Domain Linear Prediction
Author
Peddinti, Vijayaditya ; Hermansky, Hynek
Author_Institution
Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA
fYear
2013
Firstpage
7102
Lastpage
7106
Abstract
The sub-band Frequency Domain Linear Prediction (FDLP) technique estimates autoregressive models of Hilbert envelopes of subband signals, from segments of discrete cosine transform (DCT) of a speech signal, using windows. Shapes of the windows and their positions on the cosine transform of the signal determine implied filtering of the signal. Thus, the choices of shape, position and number of these windows can be critical for the performance of the FDLP technique. So far, we have used Gaussian or rectangular windows. In this paper asymmetric cochlear-like filters are being studied. Further, a frequency differentiation operation, that introduces an additional set of parameters describing local spectral slope in each frequency sub-band, is introduced to increase the robustness of sub-band envelopes in noise. The performance gains achieved by these changes are reported in a variety of additive noise conditions, with an average relative improvement of 8.04% in phoneme recognition accuracy.
Keywords
Gaussian processes; Hilbert transforms; autoregressive processes; channel bank filters; discrete cosine transforms; frequency-domain analysis; prediction theory; speech recognition; DCT; FDLP technique; Gaussian windows; Hilbert envelopes; additive noise conditions; asymmetric cochlear-like filters; autoregressive models; discrete cosine transform; filter-bank optimization; frequency differentiation operation; local spectral slope; phoneme recognition accuracy; rectangular windows; signal filtering; speech signal; sub-band frequency domain linear prediction; subband envelopes; subband signals; windows shape; Accuracy; Discrete cosine transforms; Noise; Robustness; Speech; Speech processing; Speech recognition; cochlear filters; robust speech recognition; spectral differentiation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639040
Filename
6639040
Link To Document