DocumentCode
352332
Title
Speaker adaptation based on combination of MAP estimation and weighted neighbor regression
Author
He, Lei ; Wu, Jian ; Fang, Ditang ; Wu, Wenhu
Author_Institution
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Volume
2
fYear
2000
fDate
2000
Abstract
This paper describes a novel speaker adaptation method that combines the maximum a posteriori (MAP) estimation and the weighted neighbor regression (WNR). The primary disadvantage of MAP adaptation is that only the parameters of those models with adaptation data are updated, thus great deals of adaptation data are required. In this paper, a technique called WNR is presented, in which the information of model neighbors is used to overcome that problem. The parameter relationships between the speaker independent models and the speaker adaptation models are trained by applying the distance weighted regressions to a set of neighbor model parameters with and without MAP adaptation. It gives nearly 15 percent error rate reduction with 10 adaptation utterances and more than 51 percent with 250 utterances in Chinese syllable recognition. In addition, the vector field smoothing (VFS) can be proved to be a degenerate case of WNR
Keywords
maximum likelihood estimation; speech recognition; Chinese syllable recognition; MAP estimation; WNR; distance weighted regressions; error rate reduction; maximum a posteriori estimation; parameter relationships; speaker adaptation; speaker independent models; vector field smoothing; weighted neighbor regression; Adaptation model; Automatic speech recognition; Bayesian methods; Computer science; Convergence; Error analysis; Helium; Laboratories; Smoothing methods; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1520-6149
Print_ISBN
0-7803-6293-4
Type
conf
DOI
10.1109/ICASSP.2000.859126
Filename
859126
Link To Document