DocumentCode :
2233734
Title :
Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm
Author :
Chuan, Ching-Hua ; Chew, Elaine
Author_Institution :
Department of Computer Science, University of Southern California Viterbi School of Engineering, Los Angeles, CA, chinghuc@usc.edu
fYear :
2005
fDate :
6-8 July 2005
Firstpage :
21
Lastpage :
24
Abstract :
Key finding is an integral step in content-based music indexing and retrieval. In this paper, we present an O(n) real-time algorithm for determining key from polyphonic audio. We use the standard Fast Fourier Transform with a local maximum detection scheme to extract pitches and pitch strengths from polyphonic audio. Next, we use Chew´s Spiral Array Center of Effect Generator (CEG) algorithm to determine the key from pitch strength information. We test the proposed system using Mozart´s Symphonies. The test data is audio generated from MIDI source. The algorithm achieves a maximum correct key recognition rate of 96% within the first fifteen seconds, and exceeds 90% within the first three seconds. Starting from the extracted pitch strength information, we compare the CEG algorithm´s performance to the classic Krumhansl-Schmuckler (K-S) probe tone profile method and Temperley´s modified version of the K-S method. Correct key recognition rates for the K-S and modified K-S methods remain under 50% in the first three seconds, with maximum values of 80% and 87% respectively within the first fifteen seconds for the same test set. The CEG method consistently scores higher throughout the fifteen-second selections.
Keywords :
Computer science; Content based retrieval; Data mining; Fast Fourier transforms; Indexing; Music information retrieval; Probes; Spirals; Testing; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on
Print_ISBN :
0-7803-9331-7
Type :
conf
DOI :
10.1109/ICME.2005.1521350
Filename :
1521350
Link To Document :
بازگشت