DocumentCode :
698736
Title :
Melody spotting in raw audio recordings using Variable Duration Hidden Markov Models
Author :
Pikrakis, Aggelos ; Theodoridis, Sergios
Author_Institution :
Dept. of Inf. & Telecommun., Univ. of Athens, Athens, Greece
fYear :
2005
fDate :
4-8 Sept. 2005
Firstpage :
1
Lastpage :
4
Abstract :
This paper presents a melody spotting system based on Variable Duration Hidden Markov Models (VDHMM´s), capable of locating monophonic melodies in a database of raw audio recordings. The raw audio recordings may either contain a single instrument performing in solo mode, or an ensemble of instruments where one of the instruments has a leading role. The melody to be spotted is treated as a pattern and is first converted into a sequence of note durations and music intervals. Based on this representation, a VDHMM is constructed. For each raw audio recording in the database, a sequence of note durations and music intervals is extracted by means of a multipitch tracking algorithm. These sequences are subsequently fed as input to the VDHMM that models the melody to be located. The VDHMM employs an enhanced Viterbi algorithm, previously introduced by the authors, in order to account for pitch tracking errors and performance improvisations of the instrument players. It then suffices to post-process the best-state sequence generated by the enhanced Viterbi algorithm in order to locate occurrences of the melody in question. Our method has been successfully tested with a variety of cello recordings in the context of Western Classical music, as well as with Greek traditional multi-instrument recordings where clarinet has a leading role.
Keywords :
audio databases; audio recording; audio signal processing; hidden Markov models; musical instruments; Greek traditional multiinstrument recordings; VDHMM; Viterbi algorithm; cello recordings; clarinet; instrument players; melody spotting system; monophonic melodies; multipitch tracking algorithm; music intervals; pitch tracking errors; raw audio recordings; variable duration hidden Markov models; western classical music; Audio recording; Equations; Hidden Markov models; Instruments; Mathematical model; Standards; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2005 13th European
Conference_Location :
Antalya
Print_ISBN :
978-160-4238-21-1
Type :
conf
Filename :
7078330
Link To Document :
بازگشت