Title :
A user-configurable system for voice label recognition
Author :
Rose, R.C. ; Lleida, E. ; Erhart, G.W. ; Grubbe, R.V.
Author_Institution :
AT&T Res., Murray Hill, NJ, USA
Abstract :
A set of techniques for configuring a speech recognition system to a particular user are described in the context of voice label recognition over the public switched telephone network. User-configurable vocabularies are provided through automatic acoustic baseform determination based on an inventory of speaker-independent subword acoustic units. The tendency of input utterances to contain out-of-vocabulary or non-speech information is accounted for by using likelihood ratio-based utterance verification procedures. The mismatch between a given user´s utterances and the hidden Markov model is accounted for by using a frequency-warping approach to speaker normalization. The performance of these techniques was evaluated on utterances taken from a trial version of a voice label recognition service
Keywords :
acoustics; hidden Markov models; software performance evaluation; speech recognition; telephone networks; telephony; vocabulary; automatic acoustic baseform determination; frequency warping; hidden Markov model mismatch; input utterances; likelihood ratio-based utterance verification procedures; nonspeech information; out-of-vocabulary information; performance evaluation; public switched telephone network; speaker normalization; speaker-independent subword acoustic units; speech recognition system configuration; user-configurable vocabularies; voice label recognition; Acoustic testing; Automatic speech recognition; Control systems; Frequency; Hidden Markov models; Loudspeakers; Speech analysis; Speech recognition; Telephony; Vocabulary;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607428