• DocumentCode
    2964304
  • Title

    Topic-based speaker recognition for German parliamentary speeches

  • Author

    Baum, Doris

  • Author_Institution
    Dept. NetMedia, Fraunhofer IAIS, St. Augustin, Germany
  • fYear
    2009
  • fDate
    Nov. 13 2009-Dec. 17 2009
  • Firstpage
    427
  • Lastpage
    431
  • Abstract
    In the last decade, high-level features for speaker recognition have become a research focus, as they are believed to alleviate the weak point of the classical spectral/cepstral-feature-based approaches: mismatch in acoustic conditions or channel between training and test data. Identification cues such as prosody, pronunciation, and idiolect have been successfully investigated. Semantic speaker recognition, such as identifying people by the topics they frequently talk about, has not found an equal amount of attention. However, it is a promising approach, especially for broadcast data and multimedia archives, where prominent speakers can be expected to often talk about their specific subjects. This paper reports on our experiments with topic-based speaker recognition on German parliamentary speeches. Text transcripts of speeches of federal ministers were used to train speaker models based on word frequencies. For recognition, these models were applied to automatic speech recognition transcripts of parliamentary speeches and could identify the correct speaker surprisingly well, with an EER of 13.8%. Fusing this approach with a classical GMM-UBM system (with EER 14.3%) yields an improved EER of 8.6%.
  • Keywords
    speaker recognition; German parliamentary speeches; broadcast data; classical spectral-cepstral-feature-based approaches; idiolect; multimedia archives; pronunciation; prosody; semantic speaker recognition; text transcripts; topic-based speaker recognition; Acoustic testing; Automatic speech recognition; Cepstral analysis; Digital multimedia broadcasting; Frequency; Information retrieval; Loudspeakers; Multimedia communication; Speaker recognition; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
  • Conference_Location
    Merano
  • Print_ISBN
    978-1-4244-5478-5
  • Electronic_ISBN
    978-1-4244-5479-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2009.5372907
  • Filename
    5372907