مرکز منطقه ای اطلاع رساني علوم و فناوري - Very fast adaptation with a compact context-dependent eigenvoice model

DocumentCode :

1749674

Title :

Very fast adaptation with a compact context-dependent eigenvoice model

Author :

Kuhn, R. ; Perronnin, E. ; Nguyen, P. ; Junqua, J.C. ; Rigazio, L.

Author_Institution :

Panasonic Speech Technol. Lab., Panasonic Technol. Inc, Santa Barbara, CA, USA

Volume :

fYear :

2001

fDate :

2001

Firstpage :

373

Abstract :

The "eigenvoice" technique achieves rapid speaker adaptation by employing prior knowledge of speaker space obtained from reference speakers to place strong constraints on the initial model for each new speaker. It has previously been shown to yield very fast adaptation for a large-vocabulary system. In this paper, we describe a new way of applying the eigenvoice technique to context-dependent acoustic modeling, called the "eigencentroid plus delta trees" (EDT) model. Here, the context-dependent model is defined so that it consists of a speaker-dependent component with a small number of parameters linked to a speaker-independent component with far more parameters. The eigenvoice technique can then be applied to the speaker-dependent component alone to attain very fast adaptation of the entire context-dependent model (e.g., 10% relative reduction in error rate after 3 sentences). EDT requires only a small number of parameters to represent speaker space and works even if only a small amount of data is available per reference speaker

Keywords :

eigenvalues and eigenfunctions; speech recognition; trees (mathematics); EDT model; compact context-dependent eigenvoice model; context-dependent acoustic modeling; eigencentroid plus delta trees; speaker adaptation; speaker space; speaker-dependent component; speaker-independent component; speech recognition; very fast adaptation; Context modeling; Error analysis; Hidden Markov models; Laboratories; Loudspeakers; Maximum likelihood estimation; Principal component analysis; Space technology; Speech recognition; Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on

Conference_Location :

Salt Lake City, UT

ISSN :

1520-6149

Print_ISBN :

0-7803-7041-4

Type :

conf

DOI :

10.1109/ICASSP.2001.940845

Filename :

940845

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1749674