• DocumentCode
    2149658
  • Title

    Polyphonic audio-to-score alignment based on Bayesian Latent Harmonic Allocation Hidden Markov Model

  • Author

    Maezawa, Akira ; Okuno, Hiroshi G. ; Ogata, Tetsuya ; Goto, Masataka

  • Author_Institution
    Dept. of Intell. Sci. & Technol., Kyoto Univ., Kyoto, Japan
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    185
  • Lastpage
    188
  • Abstract
    This paper presents a Bayesian method for temporally aligning a music score and an audio rendition. A critical problem in audio-to-score alignment is in dealing with the wide variety of timbre and volume of the audio rendition. In contrast with existing works that achieve this through ad-hoc feature design or careful training of tone models, we propose a Bayesian audio-to-score alignment method by modeling music performance as a Bayesian Hidden Markov Model, each state of which emits a Bayesian signal model based on Latent Harmonic Allocation. After attenuating reverberation, variational Bayes method is used to iteratively adapt the alignment, instrument tone model and the volume balance at each position of the score. The method is evaluated using sixty works of classical music of a variety of instrumentation ranging from solo piano to full orchestra. We verify that our method improves the alignment accuracy compared to dynamic time warping based on chroma vector for orchestral music, or our method employed in a maximum likelihood setting.
  • Keywords
    audio signal processing; hidden Markov models; music; variational techniques; Bayesian hidden Markov model; Bayesian latent harmonic allocation; Bayesian signal model; audio rendition; chroma vector; dynamic time warping; instrument tone model; music score; orchestral music; polyphonic audio-to-score alignment; variational Bayes method; Adaptation models; Bayesian methods; Harmonic analysis; Hidden Markov models; Instruments; Timbre; Audio-to-score alignment; Variational Bayes inference;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5946371
  • Filename
    5946371