DocumentCode :
2018508
Title :
Supervised and unsupervised clustering of the speaker space for connectionist speech recognition
Author :
Konig, Yochai ; Morgan, Nelson
Author_Institution :
Int. Comput. Sci. Inst., Berkeley, CA, USA
Volume :
1
fYear :
1993
fDate :
27-30 April 1993
Firstpage :
545
Abstract :
One of the challenging problems of a speaker-independent continuous speech recognition system is how to achieve good performance with a new speaker, when the only available source of information about the new speaker is the utterance to be recognized. The authors propose a first step toward a solution, based on clustering of the speaker space. The study had two steps. The first was searching for a set of features to cluster speakers. Second, using the chosen features, two kinds of clustering were investigated: supervised-using two clusters, males and females-and unsupervised-using two, three, and five clusters. The cluster information was integrated into the connectionist speech recognition system by using the speaker cluster neural network (SCNN). The SCNN attempts to share the speaker-independent parameters and to model the cluster-dependent parameters. The results show that the best performance is achieved with the supervised clusters, resulting in an overall improvement in recognition performance.<>
Keywords :
neural nets; search problems; speech recognition; connectionist speech recognition; performance; speaker cluster neural network; supervised clustering; unsupervised clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.1993.319176
Filename :
319176
Link To Document :
بازگشت