• DocumentCode
    1632921
  • Title

    Key technologies of pre-processing and post-processing methods for embedded automatic speech recognition systems

  • Author

    He, Dongzhi ; Hou, Yibin ; Li, Yuanyuan ; Ding, Zhi-Hao

  • Author_Institution
    Inst. of Embedded Software & Syst., Beijing Univ. of Technol., Beijing, China
  • fYear
    2010
  • Firstpage
    76
  • Lastpage
    80
  • Abstract
    Signal pre-processing and post-processing are becoming two key factors that impact embedded speech recognition systems from the laboratory to practical application. Speech endpoint detection and out-of-vocabulary rejection are the most important part of the speech pre-processing and post-processing respectively. The performance of traditional speech endpoint detection based on short-term energy and zero-crossing rate degrade dramatically in noisy environments. Methods based on frequency-domain need complex computing, and they can not meet embedded systems well. In this paper, we present a new endpoint detection algorithm that is based on statistical theory for isolated-word. The correct endpoint detection rate reaches 97.40% using the method. In this paper one-class support vector machine theory is introduced to solve out-of-vocabulary rejection. Using this algorithm system, true recognition fraction(TRF) is up to 96%, and false recognition fraction(FRF ) is about 95%.
  • Keywords
    signal processing; speech processing; speech recognition; embedded automatic speech recognition systems; false recognition fraction; out-of-vocabulary rejection; signal post-processing; signal pre-processing; speech endpoint detection; true recognition fraction; Frequency domain analysis; Laboratories; Noise; Speech; Speech recognition; endpoint detection; out-of-vocabulary rejection; speech recognition; support vector machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mechatronics and Embedded Systems and Applications (MESA), 2010 IEEE/ASME International Conference on
  • Conference_Location
    Qingdao, ShanDong
  • Print_ISBN
    978-1-4244-7101-0
  • Type

    conf

  • DOI
    10.1109/MESA.2010.5552096
  • Filename
    5552096