• DocumentCode
    2310998
  • Title

    Robust Endpoint Detection for Speech Recognition Based on Discriminative Feature Extraction

  • Author

    Yamamoto, Koichi ; Jabloun, Firas ; Reinhard, Klaus ; Kawamura, Akinori

  • Author_Institution
    Multimedia Lab., Toshiba Corp., Tokyo
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    Accurate endpoint detection is important for improving the speech recognition capability. This paper proposes a novel endpoint detection method which combines energy-based and likelihood ratio-based voice activity detection (VAD) criteria, where the likelihood ratio is calculated with speech/non-speech Gaussian mixture models (GMMs). Moreover, the proposed method introduces the discriminative feature extraction technique (DFE) in order to improve the speech/non-speech classification. The DFE is used in the training of parameters required for calculating the likelihood ratio. Experimental results have shown that the proposed endpointer achieves good performance compared to an energy-based endpointer in terms of start-of-speech (SOS) and end-of-speech (EOS) detections. Due to the improvement of the endpointer, the performance of automatic speech recognition (ASR) has also been improved
  • Keywords
    Gaussian processes; feature extraction; speech recognition; automatic speech recognition; discriminative feature extraction; end-of-speech detections; nonspeech Gaussian mixture models; nonspeech classification; robust endpoint detection; start-of-speech detections; voice activity detection; Automatic speech recognition; Feature extraction; Laboratories; Linear discriminant analysis; Noise level; Noise robustness; Research and development; Speech recognition; Tellurium; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660143
  • Filename
    1660143