DocumentCode
807577
Title
Evaluation of speech recognizers for speech training applications
Author
Anderson, Sven ; Kewley-Port, Diane
Author_Institution
Dept. of Organismal Biol. & Anatomy, Chicago Univ., IL, USA
Volume
3
Issue
4
fYear
1995
fDate
7/1/1995 12:00:00 AM
Firstpage
229
Lastpage
241
Abstract
The use of speech recognition technology for speech training represents an important and potentially very large application of speech technology. However, speech training places unique demands on recognizer performance that have not been well-characterized. In this research, a database and testing procedures were developed to evaluate two facets of recognizer performance integral to speech training: utterance identification and speech quality assessment. Using these materials, three commercial speech recognizers that employ different types of recognition algorithms were evaluated. In general, the recognizer, based on hidden Markov models (HMM´s), provided better identification scores for normal and disordered speech than the two template-based recognizers. A recognizer´s identification performance on normal speech often predicted its identification performance on disordered speech. For each recognizer, analysis using phonological features revealed classes of speech sounds that are poorly discriminated. Procedures were developed to provide human ratings of the quality of disordered speech for comparison to recognizer performance. Recognizers were compared to speech-language pathologists with respect to the ability to judge speech quality. In contrast, with identification performance, the two speech recognizers based on template comparisons provided better measures of speech quality than the HMM-based recognizer
Keywords
hidden Markov models; speech recognition; database; disordered speech; hidden Markov models; human ratings; normal speech; phonological features; recognition algorithms; recognizer performance; speech quality; speech recognizers; speech sounds; speech technology; speech training applications; speech-language pathologists; template-based recognizers; testing procedures; utterance identification; Communications technology; Databases; Face recognition; Hidden Markov models; Humans; Natural languages; Quality assessment; Speech analysis; Speech recognition; Testing;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.397088
Filename
397088
Link To Document