• DocumentCode
    699492
  • Title

    Fusion of descriptors for speech / music classification

  • Author

    Mauclair, Julie ; Pinquier, Julien

  • Author_Institution
    Lab. d´Inf., Univ. du Maine, Le Mans, France
  • fYear
    2004
  • fDate
    6-10 Sept. 2004
  • Firstpage
    1285
  • Lastpage
    1288
  • Abstract
    This work addresses the soundtrack indexing of multimedia documents. We present a speech/music classification system based on three original features: entropy modulation, stationary segment duration and number of segments. They were merged by basic score maximisation with the classical 4 Hertz modulation energy. We validate this fusion approach with the use of the probability theory and the evidence theory. The system is tested on radio corpora. Systems are simple, robust and could be improved on every corpus without training or adaptation.
  • Keywords
    document handling; entropy; indexing; inference mechanisms; multimedia computing; music; probability; sensor fusion; signal classification; speech processing; basic score maximisation; descriptor fusion; entropy modulation; evidence theory; modulation energy; multimedia documents; music classification system; number-of-segments; probability theory; radio corpora; soundtrack indexing; speech classification system; stationary segment duration; Abstracts; Entropy; Filtering theory; Reliability; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2004 12th European
  • Conference_Location
    Vienna
  • Print_ISBN
    978-320-0001-65-7
  • Type

    conf

  • Filename
    7080022