• DocumentCode
    336769
  • Title

    Speaker normalized spectral subband parameters for noise robust speech recognition

  • Author

    Tsuge, Satoru ; Fukuda, Toshio ; Singer, Harald

  • Author_Institution
    ATR Interpreting Telephony Res. Labs., Kyoto, Japan
  • Volume
    1
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    285
  • Abstract
    This paper proposes speaker normalized spectral subband centroids (SSCs) as supplementary features in noise environment speech recognition. SSCs are computed as frequency centroids for each subband from the power spectrum of the speech signal. Since the conventional SSCs depend on the formant frequencies of a speaker, we introduce a speaker normalization technique into SSC computation to reduce the speaker variability. Experimental results on spontaneous speech recognition show that the speaker normalized SSCs are more useful as supplementary features for improving the recognition performance than the conventional SSCs
  • Keywords
    feature extraction; noise; parameter estimation; spectral analysis; speech recognition; experimental results; formant frequencies; frequency centroids; noise robust speech recognition; power spectrum; recognition performance; speaker normalized spectral subband centroids; speaker normalized spectral subband parameters; speaker variability reduction; speech signal; spontaneous speech recognition; Databases; Dynamic range; Feature extraction; Frequency conversion; Noise robustness; Noise shaping; Signal to noise ratio; Speech recognition; Testing; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.758118
  • Filename
    758118