DocumentCode
353623
Title
Inter-class MLLR for speaker adaptation
Author
Doh, Sam-Joo ; Stern, Richard M.
Author_Institution
Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume
3
fYear
2000
fDate
2000
Firstpage
1543
Abstract
This paper examines the use of interdependencies of parameter classes in transformation-based speaker adaptation algorithms such as maximum likelihood linear regression (MLLR). In transformation-based adaptation, increasing the number of transformation classes can provide more detailed information for adaptation, but at the expense of greater estimation error with small amounts of data. In this paper we introduce a new procedure, inter-class MLLR, which utilizes the relationship between different classes to achieve both detailed and reliable transformation-based adaptation using limited data, In this method, the inter-class relation is given by a linear regression which is estimated from training data. In experiments using non-native English speakers from the Spoke 3 data in the 1994 DARPA Wall Street Journal evaluation, inter-class MLLR provided a relative reduction in word error rates of 11.3% compared to conventional MLLR
Keywords
maximum likelihood estimation; speech recognition; transforms; 1994 DARPA Wall Street Journal evaluation; Spoke 3 data; estimation error; inter-class MLLR; interdependencies; maximum likelihood linear regression; nonnative English speakers; parameter classes; speaker adaptation; transformation-based speaker adaptation algorithms; word error rates; Computer science; Error analysis; Estimation error; Linear regression; Maximum likelihood linear regression; Parameter estimation; Regression tree analysis; Speech recognition; System testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1520-6149
Print_ISBN
0-7803-6293-4
Type
conf
DOI
10.1109/ICASSP.2000.861957
Filename
861957
Link To Document