Title :
Inter-class MLLR for speaker adaptation
Author :
Doh, Sam-Joo ; Stern, Richard M.
Author_Institution :
Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
This paper examines the use of interdependencies of parameter classes in transformation-based speaker adaptation algorithms such as maximum likelihood linear regression (MLLR). In transformation-based adaptation, increasing the number of transformation classes can provide more detailed information for adaptation, but at the expense of greater estimation error with small amounts of data. In this paper we introduce a new procedure, inter-class MLLR, which utilizes the relationship between different classes to achieve both detailed and reliable transformation-based adaptation using limited data, In this method, the inter-class relation is given by a linear regression which is estimated from training data. In experiments using non-native English speakers from the Spoke 3 data in the 1994 DARPA Wall Street Journal evaluation, inter-class MLLR provided a relative reduction in word error rates of 11.3% compared to conventional MLLR
Keywords :
maximum likelihood estimation; speech recognition; transforms; 1994 DARPA Wall Street Journal evaluation; Spoke 3 data; estimation error; inter-class MLLR; interdependencies; maximum likelihood linear regression; nonnative English speakers; parameter classes; speaker adaptation; transformation-based speaker adaptation algorithms; word error rates; Computer science; Error analysis; Estimation error; Linear regression; Maximum likelihood linear regression; Parameter estimation; Regression tree analysis; Speech recognition; System testing; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.861957