DocumentCode
950049
Title
Nonspeech segment rejection based on prosodic information for robust speech recognition
Author
Tian, Ye ; Wang, Zuoying ; Lu, Dajin
Author_Institution
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Volume
9
Issue
11
fYear
2002
Firstpage
364
Lastpage
367
Abstract
A new scheme for nonspeech rejection is proposed by considering that most nonspeech segments do not have well-defined prosodic structures as speech segments do. Certain parameters characterizing the smoothness of the peak index series and of the peak amplitude series of the normalized autocorrelation function are used to make nonspeech segment rejection decisions. The receiver-operating-characteristics curve and recognition word-error-rate reduction measures show that our approach is more effective than garbage-model-based schemes when used in telephone speech recognition.
Keywords
correlation methods; speech recognition; nonspeech segment rejection; normalized autocorrelation function; peak amplitude series; peak index series; prosodic information; receiver-operating-characteristics curve; robust speech recognition; telephone speech recognition; word-error-rate reduction measures; Autocorrelation; Computational efficiency; Robustness; Speech enhancement; Speech processing; Speech recognition; Speech synthesis; Telephony; Testing; Training data;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2002.804564
Filename
1058206
Link To Document