Title :
A comparative study using manual and automatic transcriptions for diarization
Author :
Canseco, Leonardo ; Lamel, Lori ; Gauvain, Jean-Luc
Author_Institution :
Spoken Language Process. Group, LIMSI-CNRS, Orsay
Abstract :
This paper describes recent studies on speaker diarization from automatic broadcast news transcripts. Linguistic information revealing the true names of who speaks during a broadcast (the next, the previous and the current speaker) is detected by means of linguistic patterns. In order to associate the true speaker names with the speech segments, a set of rules are defined for each pattern. Since the effectiveness of linguistic patterns for diarization depends on the quality of the transcription, the performance using automatic transcripts generated with an LVCSR system are compared with those obtained using manual transcriptions. On about 150 hours of broadcast news data (295 shows) the global ratio of false identity association is about 13% for the automatic and the manual transcripts
Keywords :
natural languages; speaker recognition; automatic transcriptions; broadcast news transcripts; linguistic patterns; manual transcriptions; speaker diarization; speaker identity; Audio recording; Broadcasting; Dictionaries; IEEE news; Information resources; Information retrieval; Loudspeakers; Natural languages; Speech; Vocabulary;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
Conference_Location :
San Juan
Print_ISBN :
0-7803-9478-X
Electronic_ISBN :
0-7803-9479-8
DOI :
10.1109/ASRU.2005.1566507