DocumentCode :
454703
Title :
Who Really Spoke When? Finding Speaker Turns and Identities in Broadcast News Audio
Author :
Trantee, S.E.
Author_Institution :
Dept. of Eng., Cambridge Univ.
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
Automatic speaker segmentation and clustering methods have improved considerably over the last few years in the broadcast news domain. However, these generally still produce locally consistent relative labels (such as spkr1, spkr2) rather than true speaker identities (such as Bill Clinton, Ted Koppel). This paper presents a system which attempts to find these true identities from the text transcription of the audio using lexical pattern matching, and shows the effect on performance when using state-of-the-art speaker clustering and speech-to-text transcription systems instead of manual references
Keywords :
audio signal processing; pattern clustering; speech processing; automatic speaker segmentation; broadcast news audio; broadcast news domain; clustering methods; lexical pattern matching; speech-to-text transcription systems; state-of-the-art speaker clustering; Audio databases; Availability; Broadcasting; Clustering methods; Humans; Indexing; Information retrieval; Pattern matching; Speech processing; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1660195
Filename :
1660195
Link To Document :
بازگشت