DocumentCode
1468751
Title
Maximum A Posteriori Probability Multiple-Pitch Tracking Using the Harmonic Model
Author
Koretz, Amitai ; Tabrikian, Joseph
Author_Institution
Dept. of Electr. & Comput. Eng., Ben Gurion Univ. of the Negev, Beer-Sheva, Israel
Volume
19
Issue
7
fYear
2011
Firstpage
2210
Lastpage
2221
Abstract
In this paper, a new method for multiple fundamental frequency estimation for speech and music signals is proposed. Applications of audio and speech processing include many well-reviewed algorithms for estimating the fundamental frequency of monophonic speech and music signals. In the case of polyphonic signals, it is more difficult to successfully estimate each of the fundamental frequencies, as reflected by the dearth of existing methods addressing this problem. In this paper, a new method based on the combination of the maximum likelihood and maximum a posteriori probability criteria is derived for fundamental frequencies tracking where each one of the fundamental frequencies is modeled by a first-order Markov process. The dominant signal is modeled as a harmonic source with unknown deterministic amplitudes, while the remaining signals, including other harmonic signals, are modeled as Gaussian interference sources with an unknown covariance matrix. After estimation of the dominant source, it is removed from the signal by projection of the signal into the null subspace spanned by the estimated signal. This procedure is iterated for all the harmonic sources in the data. The algorithm is tested with speech, music, and synthetic signals where in each case, two harmonic sources of the same kind were mixed. The performance of the proposed algorithm is evaluated and compared to an existing reference method in terms of gross-error-rate as a function of signal-to-interference ratio.
Keywords
Markov processes; audio signal processing; frequency estimation; interference (signal); maximum likelihood estimation; speech processing; Gaussian interference sources; audio processing; covariance matrix; dominant source estimation; first-order Markov process; gross-error-rate; harmonic model; maximum a posteriori probability criteria; maximum a posteriori probability multiple-pitch tracking; maximum likelihood; monophonic speech; multiple fundamental frequency estimation; music signals; polyphonic signals; signal-to-interference ratio function; speech processing; speech signals; Algorithm design and analysis; Estimation; Frequency estimation; Harmonic analysis; Interference; Speech; Speech processing; F0 estimation; harmonic model; maximum a posteriori probability (MAP); multipitch estimation; multiple pitch estimation; pitch tracking;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2011.2125952
Filename
5728848
Link To Document