DocumentCode :
3528222
Title :
Fishervoice and semi-supervised speaker clustering
Author :
Chu, Stephen M. ; Tang, Hao ; Huang, Thomas S.
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4089
Lastpage :
4092
Abstract :
Speaker subspace modeling has become increasingly important in speaker recognition, diarization, and clustering. Principal component analysis (PCA) is a popular linear subspace learning technique and the approach that represents an arbitrary utterance or speaker as a linear combination of a set of basis voices based on PCA is known as the eigenvoice approach. In this paper, a novel technique, namely the fishervoice approach, is proposed. The fishervoice approach is based on linear discriminant analysis, another successful linear subspace learning technique that provides an optimized low-dimensional representation of utterances or speakers with focus on the most discriminative basis voices. We apply the fishervoice approach to speaker clustering in a semi-supervised manner and show that the fishervoice approach significantly outperforms the eigenvoice approach in all our experiments on the GALE Mandarin dataset.
Keywords :
learning (artificial intelligence); natural language processing; pattern clustering; principal component analysis; speaker recognition; GALE Mandarin dataset; fishervoice clustering; linear discriminant analysis; linear subspace learning technique; principal component analysis; semi-supervised speaker clustering; speaker diarization; speaker recognition; speaker subspace modeling; Face recognition; Linear discriminant analysis; Manifolds; Pattern recognition; Principal component analysis; Probability distribution; Scattering; Speaker recognition; Speech; Support vector machines; Linear subspace learning; eigenvoice; fishervoice; semi-supervised speaker clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960527
Filename :
4960527
Link To Document :
بازگشت