Title :
A temporal multi-view approach for audio key finding using adaboost
Author_Institution :
Univ. of North Florida, Jacksonville, FL, USA
Abstract :
Audio key finding is an integral step in content-based music indexing and retrieval. In this paper, we present a system that combines ensemble learning with an existing model-based key finding algorithm: the Fuzzy Analysis Center of Effect Generator algorithm. We demonstrate the manner in which AdaBoost improves the accuracy of FACEG using a dataset containing 2785 audio excerpts of real performances composed by Bach and Mozart. Two sets of experiments were conducted: intra-system comparison examining the effect of different settings in FACEG/AdaBoost, and inter-system comparison comparing FACEG/AdaBoost with the key finding implementation in Music Information Retrieval (MIR) toolbox. When FACEG is executed to generate keys at multiple stopping points of the excerpt, AdaBoost with multi-views of tonal information improves key detection accuracy up to 35% on the challenging dataset and up to 21% on the entire dataset.
Keywords :
content-based retrieval; indexing; learning (artificial intelligence); music; AdaBoost; MIR toolbox; audio key finding; content-based music indexing; content-based music retrieval; ensemble learning; fuzzy analysis center-of-effect generator algorithm; model-based key finding algorithm; temporal multiview approach; tonal information; Accuracy; Adaptation models; Algorithm design and analysis; Arrays; Decision trees; Hidden Markov models; Spirals; AdaBoost; Audio key finding; Center of Effect Generator; Fuzzy Analysis; Spiral Array;
Conference_Titel :
Multimedia and Expo Workshops (ICMEW), 2013 IEEE International Conference on
Conference_Location :
San Jose, CA
DOI :
10.1109/ICMEW.2013.6618295