DocumentCode
699492
Title
Fusion of descriptors for speech / music classification
Author
Mauclair, Julie ; Pinquier, Julien
Author_Institution
Lab. d´Inf., Univ. du Maine, Le Mans, France
fYear
2004
fDate
6-10 Sept. 2004
Firstpage
1285
Lastpage
1288
Abstract
This work addresses the soundtrack indexing of multimedia documents. We present a speech/music classification system based on three original features: entropy modulation, stationary segment duration and number of segments. They were merged by basic score maximisation with the classical 4 Hertz modulation energy. We validate this fusion approach with the use of the probability theory and the evidence theory. The system is tested on radio corpora. Systems are simple, robust and could be improved on every corpus without training or adaptation.
Keywords
document handling; entropy; indexing; inference mechanisms; multimedia computing; music; probability; sensor fusion; signal classification; speech processing; basic score maximisation; descriptor fusion; entropy modulation; evidence theory; modulation energy; multimedia documents; music classification system; number-of-segments; probability theory; radio corpora; soundtrack indexing; speech classification system; stationary segment duration; Abstracts; Entropy; Filtering theory; Reliability; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2004 12th European
Conference_Location
Vienna
Print_ISBN
978-320-0001-65-7
Type
conf
Filename
7080022
Link To Document