• DocumentCode
    2875557
  • Title

    A comparative study using manual and automatic transcriptions for diarization

  • Author

    Canseco, Leonardo ; Lamel, Lori ; Gauvain, Jean-Luc

  • Author_Institution
    Spoken Language Process. Group, LIMSI-CNRS, Orsay
  • fYear
    2005
  • fDate
    27-27 Nov. 2005
  • Firstpage
    415
  • Lastpage
    419
  • Abstract
    This paper describes recent studies on speaker diarization from automatic broadcast news transcripts. Linguistic information revealing the true names of who speaks during a broadcast (the next, the previous and the current speaker) is detected by means of linguistic patterns. In order to associate the true speaker names with the speech segments, a set of rules are defined for each pattern. Since the effectiveness of linguistic patterns for diarization depends on the quality of the transcription, the performance using automatic transcripts generated with an LVCSR system are compared with those obtained using manual transcriptions. On about 150 hours of broadcast news data (295 shows) the global ratio of false identity association is about 13% for the automatic and the manual transcripts
  • Keywords
    natural languages; speaker recognition; automatic transcriptions; broadcast news transcripts; linguistic patterns; manual transcriptions; speaker diarization; speaker identity; Audio recording; Broadcasting; Dictionaries; IEEE news; Information resources; Information retrieval; Loudspeakers; Natural languages; Speech; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
  • Conference_Location
    San Juan
  • Print_ISBN
    0-7803-9478-X
  • Electronic_ISBN
    0-7803-9479-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2005.1566507
  • Filename
    1566507