DocumentCode
2936985
Title
Acoustic segment modeling for speaker recognition
Author
Ma, Bin ; Zhu, Donglai ; Li, Haizhou
Author_Institution
Inst. for Infocomm Res., Singapore, Singapore
fYear
2009
fDate
June 28 2009-July 3 2009
Firstpage
1668
Lastpage
1671
Abstract
We propose a speaker recognition system based on the acoustic segment modeling technique. It is assumed that the overall sound characteristics for speakers can be covered by a set of acoustic segment models (ASMs) while the ASMs are acoustically-motivated self-organized sound units without imposing any phonetic definitions. These acoustic segment models decode a spoken utterance into a string of segment units and the mean vectors of ASMs based on the unsupervised MAP adaptation are concatenated to represent the characteristics of the specific speaker. Support vector machines are thus applied on these high dimensional feature vectors for speaker recognition. We evaluate the proposed approach in the 2006 NIST speaker recognition evaluation core condition test trials.
Keywords
acoustic signal processing; decoding; maximum likelihood estimation; speaker recognition; speech coding; support vector machines; unsupervised learning; acoustic segment modeling; high dimensional feature vector; sound characteristics; speaker recognition system; spoken utterance decoding; support vector machine; unsupervised MAP adaptation; Concatenated codes; Hidden Markov models; Kernel; Loudspeakers; Maximum likelihood linear regression; Natural languages; Speaker recognition; Speech; Support vector machine classification; Support vector machines; Acoustic segment model; supervector;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location
New York, NY
ISSN
1945-7871
Print_ISBN
978-1-4244-4290-4
Electronic_ISBN
1945-7871
Type
conf
DOI
10.1109/ICME.2009.5202841
Filename
5202841
Link To Document