مرکز منطقه ای اطلاع رساني علوم و فناوري - Speaker identification based on robust sparse coding with limited data

DocumentCode :

3447296

Title :

Speaker identification based on robust sparse coding with limited data

Author :

Taolin Wang ; Jian Cheng

Author_Institution :

Sch. of Electron. Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China

fYear :

2012

fDate :

16-18 Oct. 2012

Firstpage :

1611

Lastpage :

1614

Abstract :

The sparse representation classifier has achieved interesting classification results in face recognition. In speaker identification task, we intend to form an over complete dictionary using the GMM supervector for the training data. Then, the sparse representation is shaped as a sparsity-restricted robust regression problem. By supposing that the representation residuary and the representation coefficient are respectively independent, we use robust sparse coding (RSC) based on maximum likelihood estimation (MLE) solution to solve the sparse representation problem. In RSC, the collaborative representation strategy, taking the training utterances from all the extra classes as the nonlocal utterances of one class, is quite suitable for speaker recognition with limited data. Finally, experiments were carried out to evaluate the RSC on the ELSDSR database. The results have shown the performance of the proposed algorithm is much effective than the state-of-the-art methods of speaker identification.

Keywords :

Gaussian processes; maximum likelihood estimation; regression analysis; signal classification; signal representation; sparse matrices; speaker recognition; speech coding; ELSDSR database; GMM supervector; MLE solution; RSC; collaborative representation strategy; maximum likelihood estimation solution; nonlocal utterances; representation coefficient; representation residuary; robust sparse coding; sparse representation classifier; sparsity-restricted robust regression problem; speaker identification; speaker recognition; training data; training utterances; Adaptation models; Encoding; Maximum likelihood estimation; Robustness; Speech; Testing; Training; GMM supervector; limited data; robust sparse coding; speaker identification;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Image and Signal Processing (CISP), 2012 5th International Congress on

Conference_Location :

Chongqing

Print_ISBN :

978-1-4673-0965-3

Type :

conf

DOI :

10.1109/CISP.2012.6469907

Filename :

6469907

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3447296