• DocumentCode
    395224
  • Title

    Support vector machine fusion of idiolectal and acoustic speaker information in Spanish conversational speech

  • Author

    García-Romero, D. ; Fiérrez-Aguilar, J. ; González-Rodríguez, J. ; Ortega-García, J.

  • Author_Institution
    Univ. Politecnica de Madrid, Spain
  • Volume
    2
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    This paper proposes a support vector machine (SVM) based combining scheme that incorporates ideolectal and acoustic characteristics for speaker recognition. Two statistical model paradigms, namely GMM for acoustic modeling and bigrams for language modeling, provide multilevel speaker information that affords a better classification performance when SVM-based fusion is accomplished. This combining approach is useful for all speaker recognition tasks where a considerable amount of data is available. Motivated by the absence of Spanish databases that made feasible our research experiments, more than nine hours of Spanish conversational speech was collected and manually transcribed from broadcasted radio talk shows.
  • Keywords
    Gaussian distribution; learning automata; sensor fusion; speaker recognition; GMM; SVM; SVM-based fusion; Spanish conversational speech; acoustic modeling; acoustic speaker information; bigrams; classification performance; idiolectal information; language modeling; multilevel speaker information; speaker recognition; statistical model; support vector machine; Acoustic signal processing; Forensics; Loudspeakers; NIST; Natural languages; Speaker recognition; Speech processing; Support vector machine classification; Support vector machines; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1202336
  • Filename
    1202336