• DocumentCode
    2451810
  • Title

    Research on voice activity detection in burst and partial duration noisy environment

  • Author

    Guo, Chunyi ; Li, Runzhi ; Fan, Ming ; Liu, Kejun

  • Author_Institution
    Sch. of Inf. Eng., Zhengzhou Univ., Zhengzhou, China
  • fYear
    2012
  • fDate
    16-18 July 2012
  • Firstpage
    991
  • Lastpage
    995
  • Abstract
    Voice activity detection aims at detecting speech in noisy environment and is very important for speech recognition. In this paper, two novel methods are proposed for finding out burst and partial duration noisy signals in order to detect real speech from non-stationary noise and to improve the performance of continuous Mandarin speech recognition system. For the burst and low-energy noise adjacent to speech segment, a method using initial and final part of Chinese syllable is applied to detect accurate endpoints of speech based on traditional short-time energy and zero-crossing rates. For the isolated noise with relatively high energy, a short-time energy zero product based method is used. Both of the methods use time-domain features and have low computational complexity. Experimental results show that the system using proposed methods has improved accuracy in voice activity detection.
  • Keywords
    computational complexity; feature extraction; natural language processing; signal detection; speech recognition; time-domain analysis; Chinese syllable; burst duration noisy environment; computational complexity; continuous Mandarin speech recognition system; nonstationary noise; partial duration noisy environment; short-time energy zero product based method; speech endpoint detection; time-domain features; voice activity detection; zero-crossing rates; Hidden Markov models; Noise measurement; Signal to noise ratio; Speech; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio, Language and Image Processing (ICALIP), 2012 International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-0173-2
  • Type

    conf

  • DOI
    10.1109/ICALIP.2012.6376759
  • Filename
    6376759