مرکز منطقه ای اطلاع رساني علوم و فناوري - Unsupervisec cross-validation adaptation algorithms for improved adaptation performance

DocumentCode :

3529449

Title :

Unsupervisec cross-validation adaptation algorithms for improved adaptation performance

Author :

Shinozaki, Takahiro ; Kubota, Yu ; Furui, Sadaoki

Author_Institution :

Dept. of Comput. Sci., Tokyo Inst. of Technol., Tokyo

fYear :

2009

fDate :

19-24 April 2009

Firstpage :

4377

Lastpage :

4380

Abstract :

An unsupervised cross-validation adaptation algorithm and its variation are proposed that introduce the idea of cross-validation in the unsupervised batch-mode adaptation framework to improve the adaptation performance. The first algorithm is constructed on a general adaptation technique such as MLLR and can be used in combination with any adaptation method. The second algorithm is a modified version of the first algorithm and works with lower computational cost by assuming MLLR. These algorithms are extensions of our previously proposed CV training methods and are useful to suppress the negative effect of the conventional unsupervised batch-mode adaptation process that reinforces the errors included in automatic transcriptions. The proposed algorithms were evaluated in domain adaptation, speaker adaptation, and in their combination for large vocabulary spontaneous speech recognition. When the domain and speaker adaptations were combined using a read speech initial model, the relative word error rate reduction by the proposed method was 29% whereas the reduction by the conventional approach was 23%.

Keywords :

maximum likelihood estimation; regression analysis; speaker recognition; vocabulary; CV training; adaptation performance; domain adaptation; maximum likelihood linear regression; read speech initial model; speaker adaptation; unsupervised batch-mode adaptation; unsupervised cross-validation adaptation; vocabulary spontaneous speech recognition; word error rate reduction; Computational efficiency; Context modeling; Error correction; Iterative algorithms; Iterative decoding; Maximum likelihood linear regression; Parameter estimation; Speech analysis; Speech recognition; Vocabulary; MAP; MLLR; Unsupervised adaptation; computational cost; cross-validation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on

Conference_Location :

Taipei

ISSN :

1520-6149

Print_ISBN :

978-1-4244-2353-8

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2009.4960599

Filename :

4960599

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3529449