• DocumentCode
    653736
  • Title

    Robust spectral representation using group delay function and stabilized weighted linear prediction for additive noise degradations

  • Author

    Gowda, Dhananjaya ; Pohjalainen, Jouni ; Alku, Paavo ; Kurimo, Mikko

  • Author_Institution
    Sch. of Electr. Eng., Dept. of Signal Process. & Acoust., Aalto Univ., Espoo, Finland
  • fYear
    2013
  • fDate
    16-19 Oct. 2013
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    In this paper, we propose a robust spectral representation using the group delay (GD) function computed from the stabilized weighted linear prediction (SWLP) coefficients. Temporal weighting of the cost function in linear prediction (LP) analysis with the short-term energy of the speech signal improves the robustness of the resultant spectrum. The additive property of the group delay function provides for better representation of weaker resonances in the spectrum, and thereby improving the robustness of the representation. The SWLP provides robustness in the temporal domain, whereas the GD function provides robustness in the frequency domain. The proposed SWLP-GD representation is shown to be robust against different types of additive noise degradations, compared to the popularly used discrete Fourier transform (DFT) or LP based representations. In a small-scale closed-set speaker recognition experiment, the cepstral features derived from the proposed SWLP-GD spectrum perform better than the traditional mel-cepstral features computed from the discrete Fourier transform (DFT) spectrum under conditions of mismatched degradations.
  • Keywords
    cepstral analysis; delays; frequency-domain analysis; prediction theory; signal representation; speaker recognition; speech processing; GD function; SWLP-GD representation; SWLP-GD spectrum; additive noise degradation; additive property; cepstral feature; cost function; frequency domain analysis; group delay function; robust spectral representation; speaker recognition; spectrum robustness improvement; speech signal processing; stabilized weighted linear prediction; temporal domain analysis; temporal weighting; Degradation; Delays; Discrete Fourier transforms; Predictive models; Robustness; Signal to noise ratio; Speech; frequency weighted segmental SNR; group delay function; robust spectrum estimation; stabilized weighted linear prediction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech Technology and Human - Computer Dialogue (SpeD), 2013 7th Conference on
  • Conference_Location
    Cluj-Napoca
  • Type

    conf

  • DOI
    10.1109/SpeD.2013.6682663
  • Filename
    6682663