Topic-based speaker recognition for German parliamentary speeches

Author

Baum, Doris

Author_Institution

Dept. NetMedia, Fraunhofer IAIS, St. Augustin, Germany

fYear

2009

fDate

Nov. 13 2009-Dec. 17 2009

Firstpage

427

Lastpage

431

Abstract

In the last decade, high-level features for speaker recognition have become a research focus, as they are believed to alleviate the weak point of the classical spectral/cepstral-feature-based approaches: mismatch in acoustic conditions or channel between training and test data. Identification cues such as prosody, pronunciation, and idiolect have been successfully investigated. Semantic speaker recognition, such as identifying people by the topics they frequently talk about, has not found an equal amount of attention. However, it is a promising approach, especially for broadcast data and multimedia archives, where prominent speakers can be expected to often talk about their specific subjects. This paper reports on our experiments with topic-based speaker recognition on German parliamentary speeches. Text transcripts of speeches of federal ministers were used to train speaker models based on word frequencies. For recognition, these models were applied to automatic speech recognition transcripts of parliamentary speeches and could identify the correct speaker surprisingly well, with an EER of 13.8%. Fusing this approach with a classical GMM-UBM system (with EER 14.3%) yields an improved EER of 8.6%.

Keywords

speaker recognition; German parliamentary speeches; broadcast data; classical spectral-cepstral-feature-based approaches; idiolect; multimedia archives; pronunciation; prosody; semantic speaker recognition; text transcripts; topic-based speaker recognition; Acoustic testing; Automatic speech recognition; Cepstral analysis; Digital multimedia broadcasting; Frequency; Information retrieval; Loudspeakers; Multimedia communication; Speaker recognition; Training data;

fLanguage

English

Publisher

ieee

Conference_Titel

Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on

Conference_Location

Merano

Print_ISBN

978-1-4244-5478-5

Electronic_ISBN

978-1-4244-5479-2

Type

conf

DOI

10.1109/ASRU.2009.5372907

Filename

5372907