DocumentCode :
179472
Title :
Audio part mixture alignment based on hierarchical nonparametric Bayesian model of musical audio sequence collection
Author :
Maezawa, Akira ; Okuno, Hiroshi G.
Author_Institution :
R&D Div., Yamaha Corp., Iwata, Japan
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
5212
Lastpage :
5216
Abstract :
This paper proposes “audio part mixture alignment,” a method for temporally aligning multiple audio signals, each of which is a rendition of a non-disjoint subset of a common piece of music. The method decomposes each audio signal into shared components and components unique to each rendition. At the same time, it aligns each audio signal based on the shared component. Decomposition of audio signal is modeled using a hierarchical Dirichlet process (Hierarchical DP, HDP), and sequence alignment is modeled as a left-to-right hidden Markov model (HMM). Variational Bayesian inference is used to jointly infer the alignment and component decomposition. The proposed method is compared with a classic audio-to-audio alignment method, and it is found that the proposed method is more robust to the discrepancy of parts between two audio signals.
Keywords :
Bayes methods; audio signal processing; hidden Markov models; music; nonparametric statistics; Bayesian inference; HDP; HMM; audio part mixture alignment; audio signals; audio-to-audio alignment method; component decomposition; hidden Markov model; hierarchical DP; hierarchical Dirichlet process; hierarchical nonparametric Bayesian model; musical audio sequence collection; nondisjoint subset; sequence alignment; Atomic measurements; Bayes methods; Hidden Markov models; Instruments; Music; Source separation; Spectrogram; Audio-audio alignment; Nonparametric hierarchical bayes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854597
Filename :
6854597
Link To Document :
بازگشت