Title :
A predominant-F0 estimation method for CD recordings: MAP estimation using EM algorithm for adaptive tone models
Author_Institution :
PRESTO, Japan Sci. & Technol. Corp., Tsukuba, Japan
Abstract :
This paper describes a predominant-F0 (fundamental frequency) estimation method called PreFEst, which can detect melody and bass lines in monaural audio signals containing sounds of various instruments, While most previous methods premised mixtures of a few sounds and had difficulty dealing with such complex signals, our method can estimate the F0 of the melody and bass lines without assuming the number of sound sources in compact-disc recordings. In this paper we propose the following three extensions to our previous PreFEst to make it more adaptive and flexible: introducing multiple harmonic-structure tone models, estimating the shape of tone models, and introducing a prior distribution of its shape and F0 estimates These extensions were implemented by the MAP (maximum a posteriori probability) estimation by using the expectation-maximization algorithm. Experimental results with compact-disc recordings showed that our real-time system based on the extended PreFEst achieved performance improvement
Keywords :
audio signal processing; frequency estimation; iterative methods; maximum likelihood estimation; music; CD recordings; EM algorithm; MAP estimation; PreFEst; adaptive tone models; bass lines; compact-disc recordings; expectation-maximization algorithm; fundamental frequency estimation method; maximum a posteriori probability; melody; monaural audio signals; multiple harmonic-structure tone models; predominant-F0 estimation method; prior distribution; real-time system; tone models shape; Adaptive signal detection; Audio recording; CD recording; Disk recording; Frequency estimation; Humans; Instruments; Laboratories; Real time systems; Shape;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940380