مرکز منطقه ای اطلاع رساني علوم و فناوري - Segmental eigenvoice with delicate eigenspace for improved speaker adaptation

DocumentCode :

774799

Title :

Segmental eigenvoice with delicate eigenspace for improved speaker adaptation

Author :

Tsao, Yu ; Lee, Shang-Ming ; Lee, Lin-shan

Author_Institution :

Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan

Volume :

Issue :

fYear :

2005

fDate :

5/1/2005 12:00:00 AM

Firstpage :

399

Lastpage :

411

Abstract :

Eigenvoice techniques have been proposed to provide rapid speaker adaptation with very limited adaptation data, but the performance may be saturated when more adaptation data become available. This is because in these techniques an eigenspace with reduced dimensionality is established by properly utilizing the a priori knowledge from the large quantity of training data. The reduced dimensionality of the eigenspace requires less adaptation data to estimate the model parameters for the new speaker, but also makes it less easy to obtain more precise models with more adaptation data. In this paper, a new segmental eigenvoice approach is proposed, in which the eigenspace can be further segmented into N subeigenspaces by properly classifying the model parameters into N clusters. These N subeigenspaces can help to construct a more delicate eigenspace and more precise models when more adaptation data are available. It will be shown that there can be at least mixture-based, model-based and feature-based segmental eigenvoice approaches. Not only improved performance can be obtained, but these different approaches can be properly integrated to offer better performance. Two further approaches leading to improved segmental eigenvoice techniques with even better performance are also proposed. The experiments were performed with both a large vocabulary and a small vocabulary recognition tasks.

Keywords :

eigenvalues and eigenfunctions; speaker recognition; adaptation data; delicate eigenspace; segmental eigenvoice technique; speaker adaptation; training data; vocabulary recognition task; Acoustic testing; Adaptation model; Automatic speech recognition; Helium; Loudspeakers; Maximum likelihood linear regression; Parameter estimation; Principal component analysis; Training data; Vocabulary; Eigenvector approach; principal component analysis; speaker adaptation;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/TSA.2005.845819

Filename :

1420374

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=774799