DocumentCode :
1093891
Title :
A training procedure for isolated word recognition systems
Author :
Furui, Sadaoki
Author_Institution :
Nippon Telegraph and Telephone Public Corporation, Tokyo, Japan
Volume :
28
Issue :
2
fYear :
1980
fDate :
4/1/1980 12:00:00 AM
Firstpage :
129
Lastpage :
136
Abstract :
A procedure has been devised to reduce the amount of training required for a phoneme-based speaker-dependent word recognition system and still maintain performance. Each new speaker is required to provide utterances of only a fraction of the entire vocabulary as a training set. A set of transformation rules is used to estimate phoneme templates for the entire vocabulary from phoneme templates included in the training. The transformation rules are obtained in a pretraining procedure in which a group of speakers provides utterances of the entire vocabulary and multiple regression analysis (MRA) is used to obtain linear estimates of the entire phoneme template set in terms of the set designated as training templates. This group of speakers is generally distinct from the group of training speakers. Thus, since the transformation rules are established independent of the training speakers, the entire procedure can be considered a hybrid speaker-dependent/ speaker-independent system. Results of recognition experiments using spoken digits uttered by 30 male and female speakers and 67 airport names uttered by 30 male speakers have ascertained the effectiveness of this training procedure. A mean recognition accuracy of 98.2 percent was obtained for the latter utterance set after a 12-word training procedure.
Keywords :
Acoustic measurements; Airports; Automatic speech recognition; Regression analysis; Speech processing; Speech recognition; System testing; Telegraphy; Telephony; Vocabulary;
fLanguage :
English
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
0096-3518
Type :
jour
DOI :
10.1109/TASSP.1980.1163393
Filename :
1163393
Link To Document :
بازگشت