Title :
An embedded word training procedure for connected digit recognition
Author :
Rabiner, L.R. ; Bergh, A. ; Wilpon, J.G.
Author_Institution :
AT&T Bell Laboratories, Murray Hill, New Jersey
Abstract :
The "conventional" way of obtaining word reference patterns for connected word recognition systems is to use isolated word patterns, and to rely on the dynamics of the matching algorithm to account for the differences in connected speech. Connected word recognition, based on such an approach, tends to become unreliable (high error rates) when the talking rate becomes grossly incommensurate with the rate at which the isolated word training patterns were spoken. To alleviate this problem, an improved training procedure for connected word (digit) recognition is proposed in which word reference patterns from isolated occurrences of the vocabulary words are combined with word reference patterns extracted from within connected word strings to give a robust, reliable word recognizer over all normal speaking rates. In a test of the system (as a speaker trained, connected digit recognizer) with 18 talkers each speaking 40 different strings (of variable length from 2 to 5 digits), median string error rates of 0% and 2.5% were obtained for deliberately spoken strings and naturally spoken strings, respectively, when the string length was known. Using just isolated word training tokens, the comparable error rates were 10% and 11.3% respectively.
Keywords :
Concatenated codes; Error analysis; Fasteners; Pattern matching; Pattern recognition; Robustness; Speech recognition; System testing; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
DOI :
10.1109/ICASSP.1982.1171810