Title :
Audio stream segregation of multi-pitch music signal based on time-space clustering using Gaussian kernel 2-dimensional model
Author :
Kameoka, Hirokazu ; Nishimoto, Takuya ; Sagayama, Shigeki
Author_Institution :
Graduate Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Japan
Abstract :
The paper describes a novel approach for audio stream segregation of a multi-pitch music signal. We propose a parameter-constrained time-frequency spectrum model expressing both a harmonic spectral structure and a temporal curve of the power envelope with Gaussian kernels. MAP estimation of the model parameters using the EM algorithm provides fundamental frequency, onset and offset time, spectral envelope and power envelope of every underlying audio stream. Our proposed method showed high accuracy in a pitch name estimation task of several pieces of real music performance data.
Keywords :
Gaussian processes; audio signal processing; maximum likelihood estimation; music; optimisation; time-frequency analysis; EM algorithm; Gaussian kernel 2-dimensional model; MAP estimation; audio signal analysis; audio stream segregation; fundamental frequency; harmonic spectral structure; multi-pitch music signal; offset time; onset time; pitch name estimation; power envelope temporal curve; spectral envelope; time-frequency spectrum model; time-space clustering; Frequency estimation; Information science; Kernel; Multiple signal classification; Power harmonic filters; Power system harmonics; Signal analysis; Solid modeling; Streaming media; Time frequency analysis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
Print_ISBN :
0-7803-8874-7
DOI :
10.1109/ICASSP.2005.1415632