DocumentCode :
2520911
Title :
Polyphonic music transcription using note event modeling
Author :
Ryynänen, Matti P. ; Klapuri, Anssi
Author_Institution :
Inst. of Signal Process., Tampere Univ. of Technol., Finland
fYear :
2005
fDate :
16-19 Oct. 2005
Firstpage :
319
Lastpage :
322
Abstract :
This paper proposes a method for the automatic transcription of real-world music signals, including a variety of musical genres. The method transcribes notes played with pitched musical instruments. Percussive sounds, such as drums, may be present but they are not transcribed. Musical notations (i.e., MIDI files) are produced from acoustic stereo input files using probabilistic note event modeling. Note events are described with a hidden Markov model (HMM). The model uses three acoustic features extracted with a multiple fundamental frequency (FO) estimator to calculate the likelihoods of different notes and performs temporal segmentation of notes. The transitions between notes are controlled with a musicological model involving musical key estimation and bigram models. The final transcription is obtained by searching for several paths through the note models. Evaluation was carried out with a realistic music database. Using strict evaluation criteria, 39% of all the notes were found (recall) and 41% of the transcribed notes were correct (precision). Taken the complexity of the considered transcription task, the results are encouraging.
Keywords :
acoustic signal processing; feature extraction; frequency estimation; hidden Markov models; music; musical instruments; HMM; acoustic features extraction; bigram models; hidden Markov model; multiple fundamental frequency estimator; music database; musical key estimation; musical notations; note event modeling; percussive sounds; pitched musical instruments; polyphonic music transcription; real-world music signals; Audio recording; Computer applications; Databases; Feature extraction; Frequency estimation; Hidden Markov models; Instruments; Multiple signal classification; Music; Signal processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 2005. IEEE Workshop on
Print_ISBN :
0-7803-9154-3
Type :
conf
DOI :
10.1109/ASPAA.2005.1540233
Filename :
1540233
Link To Document :
بازگشت