DocumentCode :
950049
Title :
Nonspeech segment rejection based on prosodic information for robust speech recognition
Author :
Tian, Ye ; Wang, Zuoying ; Lu, Dajin
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Volume :
9
Issue :
11
fYear :
2002
Firstpage :
364
Lastpage :
367
Abstract :
A new scheme for nonspeech rejection is proposed by considering that most nonspeech segments do not have well-defined prosodic structures as speech segments do. Certain parameters characterizing the smoothness of the peak index series and of the peak amplitude series of the normalized autocorrelation function are used to make nonspeech segment rejection decisions. The receiver-operating-characteristics curve and recognition word-error-rate reduction measures show that our approach is more effective than garbage-model-based schemes when used in telephone speech recognition.
Keywords :
correlation methods; speech recognition; nonspeech segment rejection; normalized autocorrelation function; peak amplitude series; peak index series; prosodic information; receiver-operating-characteristics curve; robust speech recognition; telephone speech recognition; word-error-rate reduction measures; Autocorrelation; Computational efficiency; Robustness; Speech enhancement; Speech processing; Speech recognition; Speech synthesis; Telephony; Testing; Training data;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2002.804564
Filename :
1058206
Link To Document :
بازگشت