DocumentCode :
3500561
Title :
Hidden Markov model for automatic transcription of MIDI signals
Author :
Takeda, Haruto ; Saito, Naoki ; Otsuki, Tomoshi ; Nakai, Mitsuru ; Shimodaira, Hiroshi ; Sagayama, Shigeki
Author_Institution :
Graduate Sch. of Inf. Sci. & Tech., Tokyo Univ., Japan
fYear :
2002
fDate :
9-11 Dec. 2002
Firstpage :
428
Lastpage :
431
Abstract :
This paper describes a Hidden Markov Model (HMM)-based method of automatic transcription of MIDI (Musical Instrument Digital Interface) signals of performed music. The problem is formulated as recognition of a given sequence of fluctuating note durations to find the most likely intended note sequence utilizing the modern continuous speech recognition technique. Combining a stochastic model of deviating note durations and a stochastic grammar representing possible sequences of notes, the maximum likelihood estimate of the note sequence is searched in terms of Viterbi algorithm. The same principle is successfully applied to a joint problem of bar line allocation, time measure recognition, and tempo estimation. Finally, durations of consecutive ηn notes are combined to form a "rhythm vector" representing tempo-free relative durations of the notes and treated in the same framework. Significant improvements compared with conventional "quantization" techniques are shown.
Keywords :
audio signal processing; electronic music; hidden Markov models; maximum likelihood estimation; pattern recognition; stochastic processes; HMM; MIDI signals; Viterbi algorithm; bar line allocation; conventional quantization techniques; deviating note durations; duration recognition; fluctuating note sequence; hidden Markov model based automatic transcription; maximum likelihood estimate; modern continuous speech recognition technique; musical instrument digital interface; performed music; rhythm vector; stochastic grammar; stochastic model; tempo estimation; tempo free relative durations; time measure recognition; Automatic speech recognition; Hidden Markov models; Instruments; Maximum likelihood estimation; Multiple signal classification; Rhythm; Speech recognition; Stochastic processes; Time measurement; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing, 2002 IEEE Workshop on
Print_ISBN :
0-7803-7713-3
Type :
conf
DOI :
10.1109/MMSP.2002.1203337
Filename :
1203337
Link To Document :
بازگشت