DocumentCode
394196
Title
Confidence of agreement among multiple LVCSR models and model combination by SVM
Author
Utsuro, Tukehito ; Yasuhiro Kodama ; Tomohiro Watanabel ; Nishizaki, Hiromitsu ; Seiichi Nakagawa
Author_Institution
Dpt. Intelligence Sci. & Technol., Kyoto Univ., Japan
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
For many practical applications of speech recognition systems, it is quite desirable to have an estimate of confidence for each hypothesized word. Unlike previous works on confidence measures, we have proposed features for confidence measures that are extracted from outputs of more than one LVCSR models. For further analysis of the proposed confidence measure, this paper examines the correlation between each word´s confidence and the word´s features such as its part-of-speech and syllable length. We then apply SVM learning technique to the task of combining outputs of multiple LVCSR models, where, as features of SVM learning, information such as the pairs of the models which output the hypothesized word are useful for improving the word recognition rate. Experimental results show that the combination results achieve a relative word error reduction of up to 72 % against the best performing single model and that of up to 36 % against ROVER.
Keywords
correlation methods; learning (artificial intelligence); learning automata; natural languages; speech recognition; ROVER; SVM; SVM learning; agreement confidence; confidence estimation; confidence measures; correlation; language models; model combination; multiple LVCSR models; part-of-speech; recognizer output voting error reduction; single model; speech recognition systems; support vector machine; syllable length; word confidence; word error reduction; word features; word recognition rate; Acoustic measurements; Application software; Automatic speech recognition; Data mining; Informatics; Length measurement; Machine learning; Speech recognition; Support vector machines; Voting;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198705
Filename
1198705
Link To Document