Title :
Support vector machine fusion of idiolectal and acoustic speaker information in Spanish conversational speech
Author :
García-Romero, D. ; Fiérrez-Aguilar, J. ; González-Rodríguez, J. ; Ortega-Garcia, J.
Author_Institution :
Speech & Signal Process. Group, Univ. Politecnica de Madrid, Spain
Abstract :
This paper proposed a support vector machine (SVM) based combining scheme that incorporates idiolectal and acoustic characteristics for speaker recognition. Two statistical model paradigms, namely GMM for acoustic modeling and bigrams for language modeling, provide multilevel speaker information that affords a better classification performance when SVM-based fusion is accomplished. This combining approach is useful for all speaker recognition tasks where a considerable amount of data is available. Motivated by the absence of Spanish databases that made feasible our research experiments, more than nine hours of Spanish conversational speech was collected and manually transcribed from broadcasted radio talk shows.
Keywords :
Gaussian processes; audio databases; natural languages; speaker recognition; statistical analysis; support vector machines; Gaussian mixture model; SVM; Spanish conversational speech; Spanish databases; acoustic speaker information; bigrams; idiolectal support vector machine fusion; language modeling; speaker recognition; statistical model paradigms; Acoustic signal processing; Forensics; Loudspeakers; NIST; Natural languages; Speaker recognition; Speech processing; Support vector machine classification; Support vector machines; System testing;
Conference_Titel :
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7965-9
DOI :
10.1109/ICME.2003.1221284