Who Really Spoke When? Finding Speaker Turns and Identities in Broadcast News Audio

Author

Trantee, S.E.

Author_Institution

Dept. of Eng., Cambridge Univ.

Volume

1

fYear

2006

fDate

14-19 May 2006

Abstract

Automatic speaker segmentation and clustering methods have improved considerably over the last few years in the broadcast news domain. However, these generally still produce locally consistent relative labels (such as spkr1, spkr2) rather than true speaker identities (such as Bill Clinton, Ted Koppel). This paper presents a system which attempts to find these true identities from the text transcription of the audio using lexical pattern matching, and shows the effect on performance when using state-of-the-art speaker clustering and speech-to-text transcription systems instead of manual references

Keywords

audio signal processing; pattern clustering; speech processing; automatic speaker segmentation; broadcast news audio; broadcast news domain; clustering methods; lexical pattern matching; speech-to-text transcription systems; state-of-the-art speaker clustering; Audio databases; Availability; Broadcasting; Clustering methods; Humans; Indexing; Information retrieval; Pattern matching; Speech processing; Testing;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on

Conference_Location

Toulouse

ISSN

1520-6149

Print_ISBN

1-4244-0469-X

Type

conf

DOI

10.1109/ICASSP.2006.1660195

Filename

1660195