• DocumentCode
    3245048
  • Title

    Improved robust features for speech recognition by integrating time-frequency principal components (TFPC) and histogram equalization (HEQ)

  • Author

    Tsai, Shang-nien ; Lee, Lin-shun

  • Author_Institution
    Graduate Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2003
  • fDate
    30 Nov.-3 Dec. 2003
  • Firstpage
    297
  • Lastpage
    302
  • Abstract
    Robustness for speech recognition technologies with respect to adverse environments has been a key issue for real applications. Time-frequency principal components (TFPC) features have been shown to be a set of powerful data-driven features under matched circumstances, while histogram equalization (HEQ) has been proposed as an efficient feature transformation approach to reduce the mismatch between training and testing conditions. It is proposed that TFPC features can be well integrated with HEQ. HEQ generates a well-matched environment, in which TFPC features can be properly utilized. Extensive experiments with respect to the AURORA2 database verified that improved performance in adverse circumstances can be achieved.
  • Keywords
    learning (artificial intelligence); principal component analysis; speaker recognition; PCA; data-driven features; feature transformation; histogram equalization; principal component analysis; speech recognition; time-frequency principal component features; Acoustic applications; Automatic speech recognition; Histograms; Linear discriminant analysis; Mel frequency cepstral coefficient; Principal component analysis; Robustness; Spatial databases; Speech recognition; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
  • Print_ISBN
    0-7803-7980-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2003.1318457
  • Filename
    1318457