DocumentCode
3012899
Title
Improvement of word recognition results by trigram model
Author
Shikano, Kiyohiro
Author_Institution
ATR Interpreting Telephony Research Laboratories, Osaka, Japan
Volume
12
fYear
1987
fDate
31868
Firstpage
1261
Lastpage
1264
Abstract
A trigram language model based on word categories is introduced in order to improve word recognition results by use of linguistic information. A trigram model based on word sequences requires a lot of memory and training samples to store and estimate its probabilities. To avoid these almost unsolvable problems, a trigram model of words whose probabilities are estimated from the trigram of categories and word occurrence probabilities in the dictionary is introduced. The probabilities of the trigram of categories and the word probabilities in the dictionary are estimated using the Brown Corpus Text Database[1]. This trigram model is efficiently applied to improve word recognition results using a dynamic programming technique. Moreover, probabilities of special word sequences (frozen word sequences) are extracted from the Brown Corpus Text Database and these probabilities are also integrated in the dynamic programming algorithm. Word recognition through speaker adaptation is carried out using three input speakers from the IBM office correspondence task database[3]. The word recognition rate was 80.9%. The trigram model improves the word recognition rate to 89.1%.
Keywords
Australia; Databases; Dictionaries; Dynamic programming; Heuristic algorithms; Ice; Laboratories; Linear predictive coding; Poles and towers; Telephony;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.
Type
conf
DOI
10.1109/ICASSP.1987.1169447
Filename
1169447
Link To Document