DocumentCode :
1440929
Title :
Vocal Melody Extraction in the Presence of Pitched Accompaniment in Polyphonic Music
Author :
Rao, Vishweshwara ; Rao, Preeti
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol. Bombay, Mumbai, India
Volume :
18
Issue :
8
fYear :
2010
Firstpage :
2145
Lastpage :
2154
Abstract :
Melody extraction algorithms for single-channel polyphonic music typically rely on the salience of the lead melodic instrument, considered here to be the singing voice. However the simultaneous presence of one or more pitched instruments in the polyphony can cause such a predominant-F0 tracker to switch between tracking the pitch of the voice and that of an instrument of comparable strength, resulting in reduced voice-pitch detection accuracy. We propose a system that, in addition to biasing the salience measure in favor of singing voice characteristics, acknowledges that the voice may not dominate the polyphony at all instants and therefore tracks an additional pitch to better deal with the potential presence of locally dominant pitched accompaniment. A feature based on the temporal instability of voice harmonics is used to finally identify the voice pitch. The proposed system is evaluated on test data that is representative of polyphonic music with strong pitched accompaniment. Results show that the proposed system is indeed able to recover melodic information lost to its single-pitch tracking counterpart, and also outperforms another state-of-the-art melody extraction system designed for polyphonic music.
Keywords :
music; speech processing; lead melodic instrument; melody extraction algorithms; pitched accompaniment; singing voice; single-channel polyphonic music; single-pitch tracking counterpart; vocal melody extraction; voice-pitch detection; Automatic control; Data mining; Frequency estimation; Humans; Instruments; Music information retrieval; Robustness; Signal representations; Switches; System testing; Fundamental frequency estimation; music information retrieval (MIR); music transcription; predominant pitch detection;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2010.2042124
Filename :
5431024
Link To Document :
بازگشت