• DocumentCode
    1059680
  • Title

    Exploring the Use of Speech Features and Their Corresponding Distribution Characteristics for Robust Speech Recognition

  • Author

    Lin, Shih-Hsiang ; Chen, Berlin ; Yeh, Yao-Ming

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Normal Univ., Taipei
  • Volume
    17
  • Issue
    1
  • fYear
    2009
  • Firstpage
    84
  • Lastpage
    94
  • Abstract
    The performance of current automatic speech recognition (ASR) systems often deteriorates radically when the input speech is corrupted by various kinds of noise sources. Several methods have been proposed to improve ASR robustness over the last few decades. The related literature can be generally classified into two categories according to whether the methods are directly based on the feature domain or consider some specific statistical feature characteristics. In this paper, we present a polynomial regression approach that has the merit of directly characterizing the relationship between speech features and their corresponding distribution characteristics to compensate for noise interference. The proposed approach and a variant were thoroughly investigated and compared with a few existing noise robustness approaches. All experiments were conducted using the Aurora-2 database and task. The results show that our approaches achieve considerable word error rate reductions over the baseline system and are comparable to most of the conventional robustness approaches discussed in this paper.
  • Keywords
    polynomials; regression analysis; speech recognition; Aurora-2 database; automatic speech recognition; noise interference; polynomial regression approach; robust speech recognition; speech features; statistical feature characteristics; Automatic speech recognition; Interference; Linear discriminant analysis; Noise level; Noise robustness; Polynomials; Spatial databases; Speech enhancement; Speech recognition; Vectors; Clustering; histogram equalization; polynomial regression; robustness; speech recognition;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2008.2007612
  • Filename
    4740142