• DocumentCode
    3500561
  • Title

    Hidden Markov model for automatic transcription of MIDI signals

  • Author

    Takeda, Haruto ; Saito, Naoki ; Otsuki, Tomoshi ; Nakai, Mitsuru ; Shimodaira, Hiroshi ; Sagayama, Shigeki

  • Author_Institution
    Graduate Sch. of Inf. Sci. & Tech., Tokyo Univ., Japan
  • fYear
    2002
  • fDate
    9-11 Dec. 2002
  • Firstpage
    428
  • Lastpage
    431
  • Abstract
    This paper describes a Hidden Markov Model (HMM)-based method of automatic transcription of MIDI (Musical Instrument Digital Interface) signals of performed music. The problem is formulated as recognition of a given sequence of fluctuating note durations to find the most likely intended note sequence utilizing the modern continuous speech recognition technique. Combining a stochastic model of deviating note durations and a stochastic grammar representing possible sequences of notes, the maximum likelihood estimate of the note sequence is searched in terms of Viterbi algorithm. The same principle is successfully applied to a joint problem of bar line allocation, time measure recognition, and tempo estimation. Finally, durations of consecutive ηn notes are combined to form a "rhythm vector" representing tempo-free relative durations of the notes and treated in the same framework. Significant improvements compared with conventional "quantization" techniques are shown.
  • Keywords
    audio signal processing; electronic music; hidden Markov models; maximum likelihood estimation; pattern recognition; stochastic processes; HMM; MIDI signals; Viterbi algorithm; bar line allocation; conventional quantization techniques; deviating note durations; duration recognition; fluctuating note sequence; hidden Markov model based automatic transcription; maximum likelihood estimate; modern continuous speech recognition technique; musical instrument digital interface; performed music; rhythm vector; stochastic grammar; stochastic model; tempo estimation; tempo free relative durations; time measure recognition; Automatic speech recognition; Hidden Markov models; Instruments; Maximum likelihood estimation; Multiple signal classification; Rhythm; Speech recognition; Stochastic processes; Time measurement; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Signal Processing, 2002 IEEE Workshop on
  • Print_ISBN
    0-7803-7713-3
  • Type

    conf

  • DOI
    10.1109/MMSP.2002.1203337
  • Filename
    1203337