• DocumentCode
    1688480
  • Title

    Filter-bank optimization for Frequency Domain Linear Prediction

  • Author

    Peddinti, Vijayaditya ; Hermansky, Hynek

  • Author_Institution
    Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA
  • fYear
    2013
  • Firstpage
    7102
  • Lastpage
    7106
  • Abstract
    The sub-band Frequency Domain Linear Prediction (FDLP) technique estimates autoregressive models of Hilbert envelopes of subband signals, from segments of discrete cosine transform (DCT) of a speech signal, using windows. Shapes of the windows and their positions on the cosine transform of the signal determine implied filtering of the signal. Thus, the choices of shape, position and number of these windows can be critical for the performance of the FDLP technique. So far, we have used Gaussian or rectangular windows. In this paper asymmetric cochlear-like filters are being studied. Further, a frequency differentiation operation, that introduces an additional set of parameters describing local spectral slope in each frequency sub-band, is introduced to increase the robustness of sub-band envelopes in noise. The performance gains achieved by these changes are reported in a variety of additive noise conditions, with an average relative improvement of 8.04% in phoneme recognition accuracy.
  • Keywords
    Gaussian processes; Hilbert transforms; autoregressive processes; channel bank filters; discrete cosine transforms; frequency-domain analysis; prediction theory; speech recognition; DCT; FDLP technique; Gaussian windows; Hilbert envelopes; additive noise conditions; asymmetric cochlear-like filters; autoregressive models; discrete cosine transform; filter-bank optimization; frequency differentiation operation; local spectral slope; phoneme recognition accuracy; rectangular windows; signal filtering; speech signal; sub-band frequency domain linear prediction; subband envelopes; subband signals; windows shape; Accuracy; Discrete cosine transforms; Noise; Robustness; Speech; Speech processing; Speech recognition; cochlear filters; robust speech recognition; spectral differentiation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639040
  • Filename
    6639040