DocumentCode :
3529449
Title :
Unsupervisec cross-validation adaptation algorithms for improved adaptation performance
Author :
Shinozaki, Takahiro ; Kubota, Yu ; Furui, Sadaoki
Author_Institution :
Dept. of Comput. Sci., Tokyo Inst. of Technol., Tokyo
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4377
Lastpage :
4380
Abstract :
An unsupervised cross-validation adaptation algorithm and its variation are proposed that introduce the idea of cross-validation in the unsupervised batch-mode adaptation framework to improve the adaptation performance. The first algorithm is constructed on a general adaptation technique such as MLLR and can be used in combination with any adaptation method. The second algorithm is a modified version of the first algorithm and works with lower computational cost by assuming MLLR. These algorithms are extensions of our previously proposed CV training methods and are useful to suppress the negative effect of the conventional unsupervised batch-mode adaptation process that reinforces the errors included in automatic transcriptions. The proposed algorithms were evaluated in domain adaptation, speaker adaptation, and in their combination for large vocabulary spontaneous speech recognition. When the domain and speaker adaptations were combined using a read speech initial model, the relative word error rate reduction by the proposed method was 29% whereas the reduction by the conventional approach was 23%.
Keywords :
maximum likelihood estimation; regression analysis; speaker recognition; vocabulary; CV training; adaptation performance; domain adaptation; maximum likelihood linear regression; read speech initial model; speaker adaptation; unsupervised batch-mode adaptation; unsupervised cross-validation adaptation; vocabulary spontaneous speech recognition; word error rate reduction; Computational efficiency; Context modeling; Error correction; Iterative algorithms; Iterative decoding; Maximum likelihood linear regression; Parameter estimation; Speech analysis; Speech recognition; Vocabulary; MAP; MLLR; Unsupervised adaptation; computational cost; cross-validation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960599
Filename :
4960599
Link To Document :
بازگشت