DocumentCode :
2800600
Title :
Speech modeling based on committee-based active learning
Author :
Hamanaka, Yuzo ; Shinoda, Koichi ; Furui, Sadaoki ; Emori, Tadashi ; Koshinaka, Takafumi
Author_Institution :
Tokyo Inst. of Technol., Tokyo, Japan
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
4350
Lastpage :
4353
Abstract :
We propose a committee-based active learning method for large vocabulary continuous speech recognition. In this approach, multiple recognizers are prepared beforehand, and the recognition results obtained from them are used for selecting utterances. Here, a progressive search method is used for aligning sentences, and voting entropy is used as a measure for selecting utterances. We apply our method not only to acoustic models but also to language models and their combination. Our method was evaluated by using 190-hour speech data in the Corpus of Spontaneous Japanese. It proved to be significantly better than random selection. It only required 63 h of data to achieve a word accuracy of 74%, while standard training (i.e., random selection) required 97 h of data. The recognition accuracy of our proposed method was also better than that of the conventional uncertainty sampling method using word posterior probabilities as the confidence measure for selecting sentences.
Keywords :
learning (artificial intelligence); search problems; speech recognition; vocabulary; Corpus of Spontaneous Japanese; acoustic models; committee-based active learning; confidence measure; language models; multiple recognizers; recognition accuracy; search method; sentence aligning; speech data; speech modeling; uncertainty sampling method; utterances; vocabulary continuous speech recognition; voting entropy; word accuracy; word posterior probabilities; Acoustic measurements; Entropy; Learning systems; Natural languages; Sampling methods; Search methods; Speech analysis; Speech recognition; Vocabulary; Voting; acoustic model; active learning; language model; progressive search; voting entropy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5495650
Filename :
5495650
Link To Document :
بازگشت