DocumentCode :
774799
Title :
Segmental eigenvoice with delicate eigenspace for improved speaker adaptation
Author :
Tsao, Yu ; Lee, Shang-Ming ; Lee, Lin-shan
Author_Institution :
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume :
13
Issue :
3
fYear :
2005
fDate :
5/1/2005 12:00:00 AM
Firstpage :
399
Lastpage :
411
Abstract :
Eigenvoice techniques have been proposed to provide rapid speaker adaptation with very limited adaptation data, but the performance may be saturated when more adaptation data become available. This is because in these techniques an eigenspace with reduced dimensionality is established by properly utilizing the a priori knowledge from the large quantity of training data. The reduced dimensionality of the eigenspace requires less adaptation data to estimate the model parameters for the new speaker, but also makes it less easy to obtain more precise models with more adaptation data. In this paper, a new segmental eigenvoice approach is proposed, in which the eigenspace can be further segmented into N subeigenspaces by properly classifying the model parameters into N clusters. These N subeigenspaces can help to construct a more delicate eigenspace and more precise models when more adaptation data are available. It will be shown that there can be at least mixture-based, model-based and feature-based segmental eigenvoice approaches. Not only improved performance can be obtained, but these different approaches can be properly integrated to offer better performance. Two further approaches leading to improved segmental eigenvoice techniques with even better performance are also proposed. The experiments were performed with both a large vocabulary and a small vocabulary recognition tasks.
Keywords :
eigenvalues and eigenfunctions; speaker recognition; adaptation data; delicate eigenspace; segmental eigenvoice technique; speaker adaptation; training data; vocabulary recognition task; Acoustic testing; Adaptation model; Automatic speech recognition; Helium; Loudspeakers; Maximum likelihood linear regression; Parameter estimation; Principal component analysis; Training data; Vocabulary; Eigenvector approach; principal component analysis; speaker adaptation;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2005.845819
Filename :
1420374
Link To Document :
بازگشت