مرکز منطقه ای اطلاع رساني علوم و فناوري - A Fishervoice based feature fusion method for short utterance speaker recognition

DocumentCode :

3430617

Title :

A Fishervoice based feature fusion method for short utterance speaker recognition

Author :

Chenhao Zhang ; Zheng, Thomas Fang

Author_Institution :

Div. of Tech. Innovation & Dev., Tsinghua Univ., Beijing, China

fYear :

2013

fDate :

6-10 July 2013

Firstpage :

165

Lastpage :

169

Abstract :

For GMM-UBM based text-independent speaker recognition, the performance decreases significantly when the utterance is getting too short, and that is mostly due to the lack of distinguishable information from a single kind of feature. Fusion of different features followed by a dimensionality reduction process has been proved useful to provide a satisfying solution. However, some fusion methods based on the traditional Linear Discriminant Analysis (LDA) may cause the singular matrix problem. Therefore, a Fishervoice based feature fusion method incorporating with the Principal Component Analysis (PCA) and the LDA is proposed, where several features, such as MFCC, PLAR and LPCC, which are commonly used, are concatenated, and then projected into a lower-dimensional subspace. Compared with the baseline GMM-UBM systems using any single feature and using the LDA based fusion method, the proposed one can effectively reduce the equal error rate and give the best performance for text-independent speaker recognition for utterances as short as about 2 seconds.

Keywords :

matrix algebra; principal component analysis; sensor fusion; speaker recognition; Fishervoice based feature fusion method; GMM-UBM; LDA; PCA; linear discriminant analysis; principal component analysis; short utterance speaker recognition; singular matrix problem; text-independent speaker recognition; Feature extraction; Mel frequency cepstral coefficient; Principal component analysis; Speaker recognition; Speech; Speech recognition; Vectors; Feature fusion; Fishervoice; LDA; PCA; Short utterance speaker recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal and Information Processing (ChinaSIP), 2013 IEEE China Summit & International Conference on

Conference_Location :

Beijing

Type :

conf

DOI :

10.1109/ChinaSIP.2013.6625320

Filename :

6625320

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3430617