DocumentCode
2875557
Title
A comparative study using manual and automatic transcriptions for diarization
Author
Canseco, Leonardo ; Lamel, Lori ; Gauvain, Jean-Luc
Author_Institution
Spoken Language Process. Group, LIMSI-CNRS, Orsay
fYear
2005
fDate
27-27 Nov. 2005
Firstpage
415
Lastpage
419
Abstract
This paper describes recent studies on speaker diarization from automatic broadcast news transcripts. Linguistic information revealing the true names of who speaks during a broadcast (the next, the previous and the current speaker) is detected by means of linguistic patterns. In order to associate the true speaker names with the speech segments, a set of rules are defined for each pattern. Since the effectiveness of linguistic patterns for diarization depends on the quality of the transcription, the performance using automatic transcripts generated with an LVCSR system are compared with those obtained using manual transcriptions. On about 150 hours of broadcast news data (295 shows) the global ratio of false identity association is about 13% for the automatic and the manual transcripts
Keywords
natural languages; speaker recognition; automatic transcriptions; broadcast news transcripts; linguistic patterns; manual transcriptions; speaker diarization; speaker identity; Audio recording; Broadcasting; Dictionaries; IEEE news; Information resources; Information retrieval; Loudspeakers; Natural languages; Speech; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
Conference_Location
San Juan
Print_ISBN
0-7803-9478-X
Electronic_ISBN
0-7803-9479-8
Type
conf
DOI
10.1109/ASRU.2005.1566507
Filename
1566507
Link To Document