مرکز منطقه ای اطلاع رساني علوم و فناوري - Melody Extraction and Musical Onset Detection via Probabilistic Models of Framewise STFT Peak Data

DocumentCode :

780185

Title :

Melody Extraction and Musical Onset Detection via Probabilistic Models of Framewise STFT Peak Data

Author :

Thornburg, Harvey ; Leistikow, Randal J. ; Berger, Jonathan

Author_Institution :

Dept. of Electr. Eng., Arizona State Univ., Tempe, AZ

Volume :

Issue :

fYear :

2007

fDate :

5/1/2007 12:00:00 AM

Firstpage :

1257

Lastpage :

1272

Abstract :

We propose a probabilistic method for the joint segmentation and melody extraction for musical audio signals which arise from a monophonic score. The method operates on framewise short-time Fourier transform (STFT) peaks, enabling a computationally efficient inference of note onset, duration, and pitch attributes while retaining sufficient information for pitch determination and spectral change detection. The system explicitly models note events in terms of transient and steady-state regions as well as possible gaps between note events. In this way, the system readily distinguishes abrupt spectral changes associated with musical onsets from other abrupt change events. Additionally, the method may incorporate melodic context by modeling note-to-note dependences. The method is successfully applied to a variety of piano and violin recordings containing reverberation, effective polyphony due to legato playing style, expressive pitch variations, and background voices. While the method does not provide a sample-accurate segmentation, it facilitates the latter in subsequent processing by isolating musical onsets to frame neighborhoods and identifying possible pitch content before and after the true onset sample location

Keywords :

Fourier transforms; acoustic signal detection; acoustic signal processing; audio acoustics; audio recording; audio signal processing; music; musical instruments; probability; framewise STFT peak data; melody extraction; musical audio signals; musical onset detection; piano recordings; pitch determination; probabilistic models; short-time Fourier transform; signals segmentation; spectral change detection; violin recordings; Acoustics; Bayesian methods; Cities and towns; Context modeling; Data mining; Fourier transforms; Music; Reverberation; Robustness; Steady-state; Dynamic Bayesian networks; music transcription; onset detection; pitch identification;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2006.889801

Filename :

4156219

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=780185