• DocumentCode
    454703
  • Title

    Who Really Spoke When? Finding Speaker Turns and Identities in Broadcast News Audio

  • Author

    Trantee, S.E.

  • Author_Institution
    Dept. of Eng., Cambridge Univ.
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    Automatic speaker segmentation and clustering methods have improved considerably over the last few years in the broadcast news domain. However, these generally still produce locally consistent relative labels (such as spkr1, spkr2) rather than true speaker identities (such as Bill Clinton, Ted Koppel). This paper presents a system which attempts to find these true identities from the text transcription of the audio using lexical pattern matching, and shows the effect on performance when using state-of-the-art speaker clustering and speech-to-text transcription systems instead of manual references
  • Keywords
    audio signal processing; pattern clustering; speech processing; automatic speaker segmentation; broadcast news audio; broadcast news domain; clustering methods; lexical pattern matching; speech-to-text transcription systems; state-of-the-art speaker clustering; Audio databases; Availability; Broadcasting; Clustering methods; Humans; Indexing; Information retrieval; Pattern matching; Speech processing; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660195
  • Filename
    1660195