Title :
Combination of fMLLR with clustering and fMLLR with MLLR clustering for rapid speaker adaptation
Author :
Jafari, Kasra ; Almasganj, Farshad ; Shekofteh, Yasser
Author_Institution :
Biomed. Eng. Dept., Amirkabir Univ. of Technol. (Tehran Polytech.), Tehran, Iran
Abstract :
Feature space Maximum Likelihood Linear Regression (fMLLR) is known as an effective algorithm for rapid speaker adaptation to a new speaker or environment. In this paper we investigate combination of feature space transforms with speaker clustering to improve rapid speaker adaptation. fMLLR employs a single transformation matrix and a bias vector to transform the test speaker´s features, linearly. We applied fMLLR for less than 10 seconds of speech signals for Persian test speakers. It improved recognition by 1.5%. Then we proposed combination of fMLLR and clustering, the results show this method improved recognition by 2.5%. In another approach, we clustered speakers and applied Maximum Likelihood Linear Regression (MLLR) to each cluster, in this step we improved model of each cluster, and then use fMLLR for rapid speaker adaptation, our result shows 2.25% increasing in speech recognition.
Keywords :
matrix algebra; maximum likelihood estimation; pattern clustering; regression analysis; speaker recognition; feature space maximum likelihood linear regression; feature space transforms; rapid speaker adaptation; speaker clustering; speaker recognition; transformation matrix; Adaptation model; Biomedical engineering; Clustering algorithms; Hidden Markov models; Loudspeakers; Maximum likelihood linear regression; Space technology; Speech recognition; Target recognition; Testing; MLLR; clustering; fMLLR; speaker adaptation; speech recognition;
Conference_Titel :
Electronic Computer Technology (ICECT), 2010 International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-7404-2
Electronic_ISBN :
978-1-4244-7406-6
DOI :
10.1109/ICECTECH.2010.5479971