• DocumentCode
    705132
  • Title

    Comparison of noise robust methods in large vocabulary speech recognition

  • Author

    Keronen, Sami ; Remes, Ulpu ; Palomaki, Kalle J. ; Virtanen, Tuomas ; Kurimo, Mikko

  • Author_Institution
    Adaptive Inf. Res. Centre, Aalto Univ., Aalto, Finland
  • fYear
    2010
  • fDate
    23-27 Aug. 2010
  • Firstpage
    1973
  • Lastpage
    1977
  • Abstract
    In this paper, a comparison of three fundamentally different noise robust approaches is carried out. The recognition performances of multicondition training, Data-driven Parallel Model Combination (DPMC), and cluster-based missing data reconstruction methods implemented in a large vocabulary continuous speech recognition system are evaluated with Finnish language speech data consisting of real recordings in noisy environments. All three methods improve the recognition accuracy substantially in poor signal-to-noise ratio (SNR) conditions when compared to a baseline system trained on clean speech. DPMC and missing data reconstruction systems give the best performance on high SNR conditions. On low SNR conditions, the performance of multicondition trained system is ranked the best, DPMC the second best and missing data reconstruction the third.
  • Keywords
    signal denoising; signal reconstruction; speech recognition; cluster-based missing data reconstruction methods; data-driven parallel model combination; large vocabulary speech recognition; multicondition training; noise robust methods; noisy environments; recognition accuracy; Hidden Markov models; Noise measurement; Signal to noise ratio; Speech; Speech recognition; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2010 18th European
  • Conference_Location
    Aalborg
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7096405