• DocumentCode
    501226
  • Title

    A Speech Endpoint Detection Based on Dynamically Updated Threshold of Box-Counting Dimension

  • Author

    Hongbin, Gao ; Weiyi, Pang ; Chunru, Huang ; Yongqiang, Zhang

  • Author_Institution
    Coll. of Inf. Sci. & Eng., Hebei Univ. of Sci. & Technol., Shijiazhuang, China
  • Volume
    2
  • fYear
    2009
  • fDate
    15-17 May 2009
  • Firstpage
    397
  • Lastpage
    401
  • Abstract
    Accurate endpoint detection is important for speech procession. The endpoint detection problem is nontrivial for non-stationary backgrounds where noises may be introduced by the speaker, the recording environment and the transmission system. In this paper, an effective endpoint detection algorithm is proposed for improving speech signal processing performance in noisy environment. The proposed speech/pause discrimination method is based on the box-counting dimension. At the same time, a dynamically updated threshold and adaptive window are used to improve the performance of the algorithm. It is characterized by higher accuracy or flexibility, faster processing speed and less computation. Through a large number of samples for laboratory experiments, the results show that the improvements in detection accuracy over representative endpoint detection algorithms for robust speech processing.
  • Keywords
    noise; speech processing; adaptive window; box-counting dimension; dynamical updated threshold; noisy environment; pause discrimination method; speech discrimination method; speech endpoint detection; speech signal processing performance; Acoustic noise; Adaptive signal detection; Detection algorithms; Fractals; Information technology; Signal processing algorithms; Signal to noise ratio; Speech processing; Speech recognition; Working environment noise; adaptive window; box-counting dimension; signal processing; speech endpoint detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology and Applications, 2009. IFITA '09. International Forum on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-0-7695-3600-2
  • Type

    conf

  • DOI
    10.1109/IFITA.2009.381
  • Filename
    5231346