• DocumentCode
    337007
  • Title

    A robust speech detection algorithm for speech activated hands-free applications

  • Author

    Wu, D. ; Tanaka, Mitsuru ; Chen, R. ; Olorenshaw, L. ; Amador, M. ; Menendez-Pidal, X.

  • Author_Institution
    Dept. of Spoken Language Technol., Sony US Res. Labs., San Jose, CA, USA
  • Volume
    4
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    2407
  • Abstract
    This paper describes a novel noise robust speech detection algorithm that can operate reliably in severe noisy car conditions. High performance has been obtained with the following techniques: (1) noise suppression based on principal component analysis for pre-processing, (2) robust endpoint detection using dynamic parameters, and (3) speech verification using the periodicity of voiced signals with harmonic enhancement. Noise suppression improves the SNR as compared with nonlinear spectrum subtraction by about 20 dB. This makes the endpoint detection operate reliably in SNRs down to -10 dB. In car environments, road bump noises are problematic for speech detectors causing mis-detection errors. Speech verification helps to remove these errors. This technology is being used in Sony car navigation products
  • Keywords
    acoustic signal detection; automobiles; cellular radio; harmonic analysis; navigation; noise abatement; principal component analysis; speech recognition; telephone sets; SNR; Sony car navigation products; cellular telephones; dynamic parameters; harmonic enhancement; mis-detection errors; noise robust speech detection algorithm; noise suppression; noisy car conditions; nonlinear spectrum subtraction; pre-processing; principal component analysis; road bump noise; robust endpoint detection; speech activated hands-free applications; speech detectors; speech recognition; speech verification; voiced signals periodicity; Detection algorithms; Detectors; Navigation; Noise robustness; Principal component analysis; Roads; Signal to noise ratio; Speech analysis; Speech enhancement; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.758424
  • Filename
    758424