Title :
Who Really Spoke When? Finding Speaker Turns and Identities in Broadcast News Audio
Author_Institution :
Dept. of Eng., Cambridge Univ.
Abstract :
Automatic speaker segmentation and clustering methods have improved considerably over the last few years in the broadcast news domain. However, these generally still produce locally consistent relative labels (such as spkr1, spkr2) rather than true speaker identities (such as Bill Clinton, Ted Koppel). This paper presents a system which attempts to find these true identities from the text transcription of the audio using lexical pattern matching, and shows the effect on performance when using state-of-the-art speaker clustering and speech-to-text transcription systems instead of manual references
Keywords :
audio signal processing; pattern clustering; speech processing; automatic speaker segmentation; broadcast news audio; broadcast news domain; clustering methods; lexical pattern matching; speech-to-text transcription systems; state-of-the-art speaker clustering; Audio databases; Availability; Broadcasting; Clustering methods; Humans; Indexing; Information retrieval; Pattern matching; Speech processing; Testing;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1660195