Title :
Non-negative matrix factorization based discriminative features for speaker verification
Author :
Long, Yan-hua ; Dai, Li-Rong ; Wang, Er-yu ; Ma, Bin ; Guo, Wu
Author_Institution :
iFly Speech Lab., Univ. of Sci. & Technol. of China (USTC), Hefei, China
fDate :
Nov. 29 2010-Dec. 3 2010
Abstract :
Discovering a discriminative feature representative together with a suitable distance measure is the key for a successful speaker recognition system. In this paper, we propose a new approach for automatic speaker verification. The main contribution of the paper is the extraction of discriminative speaker features using non-negative matrix factorization (NMF) decomposition in the GMM mean space, and the use of cosine-distance measure for speaker classification. With the decomposition, the speaker space is represented by the pattern components while a speaker can be characterized by a coefficient vector representing a specific localization in the space. We validate the proposed approach on the 10-second training and 10-second testing condition constructed from 863 Putonghua (Mandarin) corpus. Relative 10.57% and 26.11% improvements compared to the conventional GMM-UBM system have been achieved for female and male trials respectively.
Keywords :
distance measurement; feature extraction; matrix decomposition; pattern classification; speaker recognition; GMM mean space; GMM-UBM system; automatic speaker verification; cosine distance measure; discriminative feature; nonnegative matrix factorization; speaker classification; speaker recognition system; speaker space; Feature extraction; Matrix decomposition; Speaker recognition; Speech; Speech processing; Training; Vectors; NMF; cosine-distance measure; representative feature; speaker verification;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
DOI :
10.1109/ISCSLP.2010.5684891