مرکز منطقه ای اطلاع رساني علوم و فناوري - Rapid Speaker Adaptation Using Clustered Maximum-Likelihood Linear Basis With Sparse Training Data

DocumentCode :

1051917

Title :

Rapid Speaker Adaptation Using Clustered Maximum-Likelihood Linear Basis With Sparse Training Data

Author :

Tang, Yun ; Rose, Richard

Author_Institution :

McGill Univ., Montreal

Volume :

Issue :

fYear :

2008

fDate :

3/1/2008 12:00:00 AM

Firstpage :

607

Lastpage :

616

Abstract :

Speaker space-based adaptation methods for automatic speech recognition have been shown to provide significant performance improvements for tasks where only a few seconds of adaptation speech is available. However, these techniques are not widely used in practical applications because they require large amounts of speaker-dependent training data and large amounts of computer memory. The authors propose a robust, low-complexity technique within this general class that has been shown to reduce word error rate, reduce the large storage requirements associated with speaker space approaches, and eliminate the need for large numbers of utterances per speaker in training. The technique is based on representing speakers as a linear combination of clustered linear basis vectors and a procedure is presented for maximum-likelihood estimation of these vectors from training data. Significant word error rate reduction was obtained using these methods relative to speaker independent performance for the Resource Management and Wall Street Journal task domains.

Keywords :

maximum likelihood estimation; speaker recognition; Resource Management and Wall Street Journal; automatic speech recognition; clustered linear basis vectors; maximum-likelihood estimation; sparse training data; speaker space-based adaptation; Cluster adaptive training; eigenvoices; parameter tying; speaker adaptation; speech recognition;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2008.916530

Filename :

4443889

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1051917