DocumentCode
3123911
Title
Discriminant local information distance preserving projection for text-independent speaker recognition
Author
Liang He ; Jia Li
Author_Institution
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear
2012
fDate
5-8 Dec. 2012
Firstpage
349
Lastpage
352
Abstract
A novel method is presented based on a statistical manifold for text-independent speaker recognition. After feature extraction, speaker recognition becomes a sequence classification problem. By discarding time information, the core task is the comparison of multiple sample sets. Each set is assumed to be governed by a probability density function (PDF). We estimate the PDFs and place the estimated statistical models on a statistical manifold. Fisher information distance is applied to compute distance between adjacent PDFs. Discriminant local preserving projection is used to push adjacent PDFs which belong to different classes apart to further improve the recognition accuracy. Experiments were carried out on the NIST SRE08 tel-tel database. Our presented method gave an excellent performance.
Keywords
feature extraction; probability; signal classification; speaker recognition; statistical analysis; Fisher information distance; NIST SRE08 tel-tel database; PDF; discriminant local information distance preserving projection; feature extraction; probability density function; sequence classification problem; statistical manifold; text-independent speaker recognition; Cepstral analysis; Databases; Manifolds; NIST; Probability density function; Speaker recognition; Vectors; Fisher information; discriminant local preserving projection; information geometry; text-independent speaker recognition; total variability model;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location
Kowloon
Print_ISBN
978-1-4673-2506-6
Electronic_ISBN
978-1-4673-2505-9
Type
conf
DOI
10.1109/ISCSLP.2012.6423466
Filename
6423466
Link To Document