DocumentCode
542263
Title
Speaker selection training for large vocabulary continuous speech recognition
Author
Huang, Chao ; Chen, Tao ; Chang, Eric
Author_Institution
Microsoft Research Asia, 5F, Sigma Center, No. 49, Zhichun Road, Beijing 100080, China
Volume
1
fYear
2002
fDate
13-17 May 2002
Abstract
Acoustic variability across speakers is one of the challenges of speaker independent (SI) speech recognition systems. As a powerful solution, dominant speaker adaptation technologies such as MLLR and MAP may become inefficient because of the lack of enough enrollment data. In this paper, we propose an adaptation method based on speaker selection training, which makes full use of statistics of training corpus. Relative error rate reduction of 5.31 % is achieved when only one utterance is available. We compare different speaker selection strategies, namely. PCA, HMM and GMM based methods. In addition, impacts of number of selected cohort speakers and number of utterances from target speaker are investigated. Furthermore, comparison and integration with MLLR adaptation are also shown. Finally, some ongoing work such as dynamicalJy varying number of selected speakers, measuring the relative contribution among the selected speakers and speeding up the computationally expensive procedure of re-estimation with model synthesis are also discussed.
Keywords
Adaptation model; Chaos; Data models; Hidden Markov models; Robustness; Speech recognition; Thyristors;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location
Orlando, FL, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5743791
Filename
5743791
Link To Document