• DocumentCode
    3269006
  • Title

    Exploiting speaker segmentations for automatic role detection. An application to broadcast news documents

  • Author

    Bigot, Benjamin ; Ferrane, Isabelle ; Pinquier, Julien

  • Author_Institution
    IRIT, Univ. de Toulouse, Toulouse, France
  • fYear
    2010
  • fDate
    23-25 June 2010
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In the field of automatic audiovisual content-based indexing and structuring, finding events like interviews, debates, reports, or live commentaries requires to bridge the gap between low-level feature extraction and such high-level event detection. In our work, we consider that detecting speaker role to enrich interaction sequences between speakers is a first step to reach this goal. The generic method we propose follows a data mining approach. We assume that speaker roles can emerge from parameters extracted from speaker segmentations without taking any prior information into account. Each speaker is then represented by a feature vector carrying temporal, signal and prosodic information. In this paper, we study how methods for dimensionality reduction and classification can help to recognize speaker roles. This method is applied to the corpus of the ESTER2 evaluation campaign and our best result reaches about 72% of well recognized roles that corresponds to nearly 79% of speech time.
  • Keywords
    data mining; document handling; feature extraction; speaker recognition; audiovisual content-based indexing; automatic role detection; broadcast news documents; data mining approach; dimensionality classification method; dimensionality reduction method; feature vector; high-level event detection; low-level feature extraction; speaker segmentation; Bridges; Broadcasting; Content based retrieval; Data mining; Event detection; Feature extraction; Indexing; Information retrieval; Speech analysis; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Content-Based Multimedia Indexing (CBMI), 2010 International Workshop on
  • Conference_Location
    Grenoble
  • ISSN
    1949-3983
  • Print_ISBN
    978-1-4244-8028-9
  • Electronic_ISBN
    1949-3983
  • Type

    conf

  • DOI
    10.1109/CBMI.2010.5529900
  • Filename
    5529900