• DocumentCode
    2880529
  • Title

    A robust endpoint detection of speech for noisy environments with application to automatic speech recognition

  • Author

    Bou-Ghazale, Sahar E. ; Assaleh, Khaled

  • Author_Institution
    Conexant Systems Inc., Wireless Communications Division, 4311 Jamboree Road, K02-851, Newport Beach, CA 92660, USA
  • Volume
    4
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    We propose a new approach for classifying speech vs. non-speech, which proves to significantly improve speech recognition performance under noise. The proposed algorithm relies on the energy and spectral characteristics of the signal and applies a 3-level two-dimensional thresholding to determine whether an input frame is speech or non-speech. The algorithm runs in real-time, and offers better immunity to background noise, and to background speech than traditional energy-based word boundary detection. The performance of the endpoint detector is reported here in terms of improvements in speaker-independent (SI) and speaker-dependent (SD) recognition performance using 5 different simulated noise conditions and various signal-to-noise ratios (SNR). The proposed endpoint detection of speech improves the SD recognition accuracy by 24% for office noise, and reduces the false rejection rates for both SI and SD by 45% for babble noise and lobby noise.
  • Keywords
    Cepstral analysis; Ferroelectric films; Noise; Noise measurement; Nonvolatile memory; Random access memory; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5745486
  • Filename
    5745486