Title :
Acoustic segment modeling for speaker recognition
Author :
Ma, Bin ; Zhu, Donglai ; Li, Haizhou
Author_Institution :
Inst. for Infocomm Res., Singapore, Singapore
fDate :
June 28 2009-July 3 2009
Abstract :
We propose a speaker recognition system based on the acoustic segment modeling technique. It is assumed that the overall sound characteristics for speakers can be covered by a set of acoustic segment models (ASMs) while the ASMs are acoustically-motivated self-organized sound units without imposing any phonetic definitions. These acoustic segment models decode a spoken utterance into a string of segment units and the mean vectors of ASMs based on the unsupervised MAP adaptation are concatenated to represent the characteristics of the specific speaker. Support vector machines are thus applied on these high dimensional feature vectors for speaker recognition. We evaluate the proposed approach in the 2006 NIST speaker recognition evaluation core condition test trials.
Keywords :
acoustic signal processing; decoding; maximum likelihood estimation; speaker recognition; speech coding; support vector machines; unsupervised learning; acoustic segment modeling; high dimensional feature vector; sound characteristics; speaker recognition system; spoken utterance decoding; support vector machine; unsupervised MAP adaptation; Concatenated codes; Hidden Markov models; Kernel; Loudspeakers; Maximum likelihood linear regression; Natural languages; Speaker recognition; Speech; Support vector machine classification; Support vector machines; Acoustic segment model; supervector;
Conference_Titel :
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location :
New York, NY
Print_ISBN :
978-1-4244-4290-4
Electronic_ISBN :
1945-7871
DOI :
10.1109/ICME.2009.5202841