مرکز منطقه ای اطلاع رساني علوم و فناوري - Probabilistic model of two-dimensional rhythm tree structure representation for automatic transcription of polyphonic MIDI signals

DocumentCode :

661446

Title :

Probabilistic model of two-dimensional rhythm tree structure representation for automatic transcription of polyphonic MIDI signals

Author :

Tsuchiya, Masahiro ; Ochiai, K. ; Kameoka, Hirokazu ; Sagayama, Shigeki

Author_Institution :

Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Hongo, Japan

fYear :

2013

fDate :

Oct. 29 2013-Nov. 1 2013

Firstpage :

Lastpage :

Abstract :

This paper proposes a Bayesian approach for automatic music transcription of polyphonic MIDI signals based on generative modeling of onset occurrences of musical notes. Automatic music transcription involves two subproblems that are interdependent of each other: rhythm recognition and tempo estimation. When we listen to music, we are able to recognize its rhythm and tempo (or beat location) fairly easily even though there is ambiguity in determining the individual note values and tempo. This may be made possible through our empirical knowledge about rhythm patterns and tempo variations that possibly occur in music. To automate the process of recognizing the rhythm and tempo of music, we propose modeling the generative process of a MIDI signal of polyphonic music by combining the sub-process by which a musically natural tempo curve is generated and the sub-process by which a set of note onset positions is generated based on a 2-dimensional rhythm tree structure representation of music, and develop a parameter inference algorithm for the proposed model. We show some of the transcription results obtained with the present method.

Keywords :

Bayes methods; music; pattern recognition; signal representation; tree data structures; 2-dimensional rhythm tree structure representation; Bayesian approach; automatic music transcription; beat location; generative onset occurrence modeling; generative process modeling; musical notes; musically natural tempo curve; note onset positions; parameter inference algorithm; polyphonic MIDI signals; rhythm patterns; rhythm recognition; tempo estimation; tempo variations; Estimation; Hidden Markov models; Multiple signal classification; Production; Rhythm; Speech recognition; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific

Conference_Location :

Kaohsiung

Type :

conf

DOI :

10.1109/APSIPA.2013.6694308

Filename :

6694308

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=661446