DocumentCode :
661446
Title :
Probabilistic model of two-dimensional rhythm tree structure representation for automatic transcription of polyphonic MIDI signals
Author :
Tsuchiya, Masahiro ; Ochiai, K. ; Kameoka, Hirokazu ; Sagayama, Shigeki
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Hongo, Japan
fYear :
2013
fDate :
Oct. 29 2013-Nov. 1 2013
Firstpage :
1
Lastpage :
6
Abstract :
This paper proposes a Bayesian approach for automatic music transcription of polyphonic MIDI signals based on generative modeling of onset occurrences of musical notes. Automatic music transcription involves two subproblems that are interdependent of each other: rhythm recognition and tempo estimation. When we listen to music, we are able to recognize its rhythm and tempo (or beat location) fairly easily even though there is ambiguity in determining the individual note values and tempo. This may be made possible through our empirical knowledge about rhythm patterns and tempo variations that possibly occur in music. To automate the process of recognizing the rhythm and tempo of music, we propose modeling the generative process of a MIDI signal of polyphonic music by combining the sub-process by which a musically natural tempo curve is generated and the sub-process by which a set of note onset positions is generated based on a 2-dimensional rhythm tree structure representation of music, and develop a parameter inference algorithm for the proposed model. We show some of the transcription results obtained with the present method.
Keywords :
Bayes methods; music; pattern recognition; signal representation; tree data structures; 2-dimensional rhythm tree structure representation; Bayesian approach; automatic music transcription; beat location; generative onset occurrence modeling; generative process modeling; musical notes; musically natural tempo curve; note onset positions; parameter inference algorithm; polyphonic MIDI signals; rhythm patterns; rhythm recognition; tempo estimation; tempo variations; Estimation; Hidden Markov models; Multiple signal classification; Production; Rhythm; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
Conference_Location :
Kaohsiung
Type :
conf
DOI :
10.1109/APSIPA.2013.6694308
Filename :
6694308
Link To Document :
بازگشت