Speaker adaptation based on combination of MAP estimation and weighted neighbor regression

Author

He, Lei ; Wu, Jian ; Fang, Ditang ; Wu, Wenhu

Author_Institution

Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China

Volume

2

fYear

2000

fDate

2000

Abstract

This paper describes a novel speaker adaptation method that combines the maximum a posteriori (MAP) estimation and the weighted neighbor regression (WNR). The primary disadvantage of MAP adaptation is that only the parameters of those models with adaptation data are updated, thus great deals of adaptation data are required. In this paper, a technique called WNR is presented, in which the information of model neighbors is used to overcome that problem. The parameter relationships between the speaker independent models and the speaker adaptation models are trained by applying the distance weighted regressions to a set of neighbor model parameters with and without MAP adaptation. It gives nearly 15 percent error rate reduction with 10 adaptation utterances and more than 51 percent with 250 utterances in Chinese syllable recognition. In addition, the vector field smoothing (VFS) can be proved to be a degenerate case of WNR

Keywords

maximum likelihood estimation; speech recognition; Chinese syllable recognition; MAP estimation; WNR; distance weighted regressions; error rate reduction; maximum a posteriori estimation; parameter relationships; speaker adaptation; speaker independent models; vector field smoothing; weighted neighbor regression; Adaptation model; Automatic speech recognition; Bayesian methods; Computer science; Convergence; Error analysis; Helium; Laboratories; Smoothing methods; System testing;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on

Conference_Location

Istanbul

ISSN

1520-6149

Print_ISBN

0-7803-6293-4

Type

conf

DOI

10.1109/ICASSP.2000.859126

Filename

859126