DocumentCode
698736
Title
Melody spotting in raw audio recordings using Variable Duration Hidden Markov Models
Author
Pikrakis, Aggelos ; Theodoridis, Sergios
Author_Institution
Dept. of Inf. & Telecommun., Univ. of Athens, Athens, Greece
fYear
2005
fDate
4-8 Sept. 2005
Firstpage
1
Lastpage
4
Abstract
This paper presents a melody spotting system based on Variable Duration Hidden Markov Models (VDHMM´s), capable of locating monophonic melodies in a database of raw audio recordings. The raw audio recordings may either contain a single instrument performing in solo mode, or an ensemble of instruments where one of the instruments has a leading role. The melody to be spotted is treated as a pattern and is first converted into a sequence of note durations and music intervals. Based on this representation, a VDHMM is constructed. For each raw audio recording in the database, a sequence of note durations and music intervals is extracted by means of a multipitch tracking algorithm. These sequences are subsequently fed as input to the VDHMM that models the melody to be located. The VDHMM employs an enhanced Viterbi algorithm, previously introduced by the authors, in order to account for pitch tracking errors and performance improvisations of the instrument players. It then suffices to post-process the best-state sequence generated by the enhanced Viterbi algorithm in order to locate occurrences of the melody in question. Our method has been successfully tested with a variety of cello recordings in the context of Western Classical music, as well as with Greek traditional multi-instrument recordings where clarinet has a leading role.
Keywords
audio databases; audio recording; audio signal processing; hidden Markov models; musical instruments; Greek traditional multiinstrument recordings; VDHMM; Viterbi algorithm; cello recordings; clarinet; instrument players; melody spotting system; monophonic melodies; multipitch tracking algorithm; music intervals; pitch tracking errors; raw audio recordings; variable duration hidden Markov models; western classical music; Audio recording; Equations; Hidden Markov models; Instruments; Mathematical model; Standards; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2005 13th European
Conference_Location
Antalya
Print_ISBN
978-160-4238-21-1
Type
conf
Filename
7078330
Link To Document