DocumentCode
454703
Title
Who Really Spoke When? Finding Speaker Turns and Identities in Broadcast News Audio
Author
Trantee, S.E.
Author_Institution
Dept. of Eng., Cambridge Univ.
Volume
1
fYear
2006
fDate
14-19 May 2006
Abstract
Automatic speaker segmentation and clustering methods have improved considerably over the last few years in the broadcast news domain. However, these generally still produce locally consistent relative labels (such as spkr1, spkr2) rather than true speaker identities (such as Bill Clinton, Ted Koppel). This paper presents a system which attempts to find these true identities from the text transcription of the audio using lexical pattern matching, and shows the effect on performance when using state-of-the-art speaker clustering and speech-to-text transcription systems instead of manual references
Keywords
audio signal processing; pattern clustering; speech processing; automatic speaker segmentation; broadcast news audio; broadcast news domain; clustering methods; lexical pattern matching; speech-to-text transcription systems; state-of-the-art speaker clustering; Audio databases; Availability; Broadcasting; Clustering methods; Humans; Indexing; Information retrieval; Pattern matching; Speech processing; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1660195
Filename
1660195
Link To Document