مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech modeling based on committee-based active learning

DocumentCode :

2800600

Title :

Speech modeling based on committee-based active learning

Author :

Hamanaka, Yuzo ; Shinoda, Koichi ; Furui, Sadaoki ; Emori, Tadashi ; Koshinaka, Takafumi

Author_Institution :

Tokyo Inst. of Technol., Tokyo, Japan

fYear :

2010

fDate :

14-19 March 2010

Firstpage :

4350

Lastpage :

4353

Abstract :

We propose a committee-based active learning method for large vocabulary continuous speech recognition. In this approach, multiple recognizers are prepared beforehand, and the recognition results obtained from them are used for selecting utterances. Here, a progressive search method is used for aligning sentences, and voting entropy is used as a measure for selecting utterances. We apply our method not only to acoustic models but also to language models and their combination. Our method was evaluated by using 190-hour speech data in the Corpus of Spontaneous Japanese. It proved to be significantly better than random selection. It only required 63 h of data to achieve a word accuracy of 74%, while standard training (i.e., random selection) required 97 h of data. The recognition accuracy of our proposed method was also better than that of the conventional uncertainty sampling method using word posterior probabilities as the confidence measure for selecting sentences.

Keywords :

learning (artificial intelligence); search problems; speech recognition; vocabulary; Corpus of Spontaneous Japanese; acoustic models; committee-based active learning; confidence measure; language models; multiple recognizers; recognition accuracy; search method; sentence aligning; speech data; speech modeling; uncertainty sampling method; utterances; vocabulary continuous speech recognition; voting entropy; word accuracy; word posterior probabilities; Acoustic measurements; Entropy; Learning systems; Natural languages; Sampling methods; Search methods; Speech analysis; Speech recognition; Vocabulary; Voting; acoustic model; active learning; language model; progressive search; voting entropy;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on

Conference_Location :

Dallas, TX

ISSN :

1520-6149

Print_ISBN :

978-1-4244-4295-9

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2010.5495650

Filename :

5495650

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2800600