Title :
Nonspeech segment rejection based on prosodic information for robust speech recognition
Author :
Tian, Ye ; Wang, Zuoying ; Lu, Dajin
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Abstract :
A new scheme for nonspeech rejection is proposed by considering that most nonspeech segments do not have well-defined prosodic structures as speech segments do. Certain parameters characterizing the smoothness of the peak index series and of the peak amplitude series of the normalized autocorrelation function are used to make nonspeech segment rejection decisions. The receiver-operating-characteristics curve and recognition word-error-rate reduction measures show that our approach is more effective than garbage-model-based schemes when used in telephone speech recognition.
Keywords :
correlation methods; speech recognition; nonspeech segment rejection; normalized autocorrelation function; peak amplitude series; peak index series; prosodic information; receiver-operating-characteristics curve; robust speech recognition; telephone speech recognition; word-error-rate reduction measures; Autocorrelation; Computational efficiency; Robustness; Speech enhancement; Speech processing; Speech recognition; Speech synthesis; Telephony; Testing; Training data;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2002.804564