• DocumentCode
    950049
  • Title

    Nonspeech segment rejection based on prosodic information for robust speech recognition

  • Author

    Tian, Ye ; Wang, Zuoying ; Lu, Dajin

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • Volume
    9
  • Issue
    11
  • fYear
    2002
  • Firstpage
    364
  • Lastpage
    367
  • Abstract
    A new scheme for nonspeech rejection is proposed by considering that most nonspeech segments do not have well-defined prosodic structures as speech segments do. Certain parameters characterizing the smoothness of the peak index series and of the peak amplitude series of the normalized autocorrelation function are used to make nonspeech segment rejection decisions. The receiver-operating-characteristics curve and recognition word-error-rate reduction measures show that our approach is more effective than garbage-model-based schemes when used in telephone speech recognition.
  • Keywords
    correlation methods; speech recognition; nonspeech segment rejection; normalized autocorrelation function; peak amplitude series; peak index series; prosodic information; receiver-operating-characteristics curve; robust speech recognition; telephone speech recognition; word-error-rate reduction measures; Autocorrelation; Computational efficiency; Robustness; Speech enhancement; Speech processing; Speech recognition; Speech synthesis; Telephony; Testing; Training data;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2002.804564
  • Filename
    1058206