مرکز منطقه ای اطلاع رساني علوم و فناوري - Time Series Models for Semantic Music Annotation

DocumentCode :

1363937

Title :

Time Series Models for Semantic Music Annotation

Author :

Coviello, Emanuele ; Chan, Antoni B. ; Lanckriet, Gert

Author_Institution :

Dept. of Electr. & Comput. Eng., Univ. of California at San Diego, La Jolla, CA, USA

Volume :

Issue :

fYear :

2011

fDate :

7/1/2011 12:00:00 AM

Firstpage :

1343

Lastpage :

1359

Abstract :

Many state-of-the-art systems for automatic music tagging model music based on bag-of-features representations which give little or no account of temporal dynamics, a key characteristic of the audio signal. We describe a novel approach to automatic music annotation and retrieval that captures temporal (e.g., rhythmical) aspects as well as timbral content. The proposed approach leverages a recently proposed song model that is based on a generative time series model of the musical content-the dynamic texture mixture (DTM) model-that treats fragments of audio as the output of a linear dynamical system. To model characteristic temporal dynamics and timbral content at the tag level, a novel, efficient, and hierarchical expectation-maximization (EM) algorithm for DTM (HEM-DTM) is used to summarize the common information shared by DTMs modeling individual songs associated with a tag. Experiments show learning the semantics of music benefits from modeling temporal dynamics.

Keywords :

audio signal processing; expectation-maximisation algorithm; information retrieval; music; time series; DTM model; HEM-DTM; automatic music annotation; automatic music retrieval; automatic music tagging model; bag-of-features representation; dynamic texture mixture; expectation-maximization algorithm; linear dynamical system; rhythmical aspect; semantic music annotation; temporal dynamics; timbral content; time series model; Clustering algorithms; Computational modeling; Feature extraction; Heuristic algorithms; Hidden Markov models; Semantics; Time series analysis; Audio annotation and retrieval; dynamic texture model; music information retrieval;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2010.2090148

Filename :

5613150

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1363937