• DocumentCode
    1589801
  • Title

    Large Vocabulary Mandarin Continuous Speech Recognition under Noisy Environment

  • Author

    Zhao, Qingwei ; Yan, Yonghong ; Pan, Jielin ; Fu, Qiang ; Zhang, Jianping ; Lv, Ping ; Pan, Fuping

  • Author_Institution
    Chinese Acad. of Sci., Beijing
  • Volume
    2
  • fYear
    2007
  • Firstpage
    660
  • Lastpage
    664
  • Abstract
    Noise environment and natural spoken speech, is still a challenging issue for speech recognition. In this paper, study on this field is explored on Mandarin speech, from aspects of signal processing, acoustic model, language model, decoding algorithm, and post processing. The two-phase mel-warped wiener filter algorithm is improved for obtaining noise-robust feature. Segmentation algorithm and gender determination algorithm is proposed based on phone decoder. MMIE training algorithm and noise-adding training technology is exploited. One-pass search algorithm is adopted based on cross-word acoustic model, compact search space is constructed which can accurately describe the acoustic context. New fusing technology of multiple sub-systems is proposed which adopt different acoustic feature. Experiments on three testing sets of Mandarin continuous speech show that, the new system obtains relative error reduction of 39.28% in contrast to the baseline system. And the performance of multiple algorithm modules has also been analyzed in detail.
  • Keywords
    Wiener filters; natural language processing; speech recognition; gender determination algorithm; large vocabulary Mandarin continuous speech recognition; natural spoken speech; noise-adding training technology; noise-robust feature; noisy environment; segmentation algorithm; two-phase mel-warped Wiener filter algorithm; Acoustic noise; Acoustic signal processing; Decoding; Signal processing algorithms; Space technology; Speech enhancement; Speech processing; Speech recognition; Vocabulary; Working environment noise; Acoustic Model; Algorithm; Confusion Network; Decoding; Fusion; LVCSR; Language Model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation, 2007. ICNC 2007. Third International Conference on
  • Conference_Location
    Haikou
  • Print_ISBN
    978-0-7695-2875-5
  • Type

    conf

  • DOI
    10.1109/ICNC.2007.459
  • Filename
    4344433