DocumentCode :
1468751
Title :
Maximum A Posteriori Probability Multiple-Pitch Tracking Using the Harmonic Model
Author :
Koretz, Amitai ; Tabrikian, Joseph
Author_Institution :
Dept. of Electr. & Comput. Eng., Ben Gurion Univ. of the Negev, Beer-Sheva, Israel
Volume :
19
Issue :
7
fYear :
2011
Firstpage :
2210
Lastpage :
2221
Abstract :
In this paper, a new method for multiple fundamental frequency estimation for speech and music signals is proposed. Applications of audio and speech processing include many well-reviewed algorithms for estimating the fundamental frequency of monophonic speech and music signals. In the case of polyphonic signals, it is more difficult to successfully estimate each of the fundamental frequencies, as reflected by the dearth of existing methods addressing this problem. In this paper, a new method based on the combination of the maximum likelihood and maximum a posteriori probability criteria is derived for fundamental frequencies tracking where each one of the fundamental frequencies is modeled by a first-order Markov process. The dominant signal is modeled as a harmonic source with unknown deterministic amplitudes, while the remaining signals, including other harmonic signals, are modeled as Gaussian interference sources with an unknown covariance matrix. After estimation of the dominant source, it is removed from the signal by projection of the signal into the null subspace spanned by the estimated signal. This procedure is iterated for all the harmonic sources in the data. The algorithm is tested with speech, music, and synthetic signals where in each case, two harmonic sources of the same kind were mixed. The performance of the proposed algorithm is evaluated and compared to an existing reference method in terms of gross-error-rate as a function of signal-to-interference ratio.
Keywords :
Markov processes; audio signal processing; frequency estimation; interference (signal); maximum likelihood estimation; speech processing; Gaussian interference sources; audio processing; covariance matrix; dominant source estimation; first-order Markov process; gross-error-rate; harmonic model; maximum a posteriori probability criteria; maximum a posteriori probability multiple-pitch tracking; maximum likelihood; monophonic speech; multiple fundamental frequency estimation; music signals; polyphonic signals; signal-to-interference ratio function; speech processing; speech signals; Algorithm design and analysis; Estimation; Frequency estimation; Harmonic analysis; Interference; Speech; Speech processing; F0 estimation; harmonic model; maximum a posteriori probability (MAP); multipitch estimation; multiple pitch estimation; pitch tracking;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2011.2125952
Filename :
5728848
Link To Document :
بازگشت