DocumentCode :
431618
Title :
Audio stream segregation of multi-pitch music signal based on time-space clustering using Gaussian kernel 2-dimensional model
Author :
Kameoka, Hirokazu ; Nishimoto, Takuya ; Sagayama, Shigeki
Author_Institution :
Graduate Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Japan
Volume :
3
fYear :
2005
fDate :
18-23 March 2005
Abstract :
The paper describes a novel approach for audio stream segregation of a multi-pitch music signal. We propose a parameter-constrained time-frequency spectrum model expressing both a harmonic spectral structure and a temporal curve of the power envelope with Gaussian kernels. MAP estimation of the model parameters using the EM algorithm provides fundamental frequency, onset and offset time, spectral envelope and power envelope of every underlying audio stream. Our proposed method showed high accuracy in a pitch name estimation task of several pieces of real music performance data.
Keywords :
Gaussian processes; audio signal processing; maximum likelihood estimation; music; optimisation; time-frequency analysis; EM algorithm; Gaussian kernel 2-dimensional model; MAP estimation; audio signal analysis; audio stream segregation; fundamental frequency; harmonic spectral structure; multi-pitch music signal; offset time; onset time; pitch name estimation; power envelope temporal curve; spectral envelope; time-frequency spectrum model; time-space clustering; Frequency estimation; Information science; Kernel; Multiple signal classification; Power harmonic filters; Power system harmonics; Signal analysis; Solid modeling; Streaming media; Time frequency analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8874-7
Type :
conf
DOI :
10.1109/ICASSP.2005.1415632
Filename :
1415632
Link To Document :
بازگشت