• DocumentCode
    33330
  • Title

    Bounded Conditional Mean Imputation with Observation Uncertainties and Acoustic Model Adaptation

  • Author

    Remes, Ulpu ; Ramirez Lopez, Ana ; Palomaki, Kalle ; Kurimo, Mikko

  • Author_Institution
    Dept. of Signal Process. & Acoust., Aalto Univ., Espoo, Finland
  • Volume
    23
  • Issue
    7
  • fYear
    2015
  • fDate
    Jul-15
  • Firstpage
    1198
  • Lastpage
    1208
  • Abstract
    Automatic speech recognition systems use noise compensation and acoustic model adaptation to increase robustness towards speaker and environmental variation. The current work focuses on noise compensation with bounded conditional mean imputation (BCMI). BCMI approaches are missing-data methods which operate on the assumption that noise-corrupted observations can be divided into reliable and unreliable components. BCMI methods substitute the unreliable components with a clean speech posterior distribution. The posterior means can be used as clean speech estimates and the posterior variances can be introduced in acoustic model likelihood calculation as observation uncertainties. In addition, we propose in the current work that similar uncertainties are introduced in acoustic model adaptation. Evaluation with speech data recorded in diverse public and car environments indicates that the proposed uncertainties improve adaptation performance. When uncertainties were used in acoustic model likelihood calculation and adaptation, the proposed imputation and adaptation system introduced 15%-84% relative error reductions to an uncompensated baseline system performance.
  • Keywords
    acoustic noise; speech recognition; BCMI; acoustic model adaptation; acoustic model likelihood calculation; automatic speech recognition systems; bounded conditional mean imputation; missing-data methods; noise compensation; noise-corrupted observations; observation uncertainties; Acoustics; Adaptation models; Gaussian distribution; Mathematical model; Reliability; Speech; Uncertainty; Acoustic model adaptation; missing data; noise-robust speech recognition; observation uncertainties;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2424322
  • Filename
    7089214