DocumentCode
940440
Title
Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model
Author
Klapuri, Anssi
Author_Institution
Inst. of Signal Process., Tampere Univ. of Technol., Tampere
Volume
16
Issue
2
fYear
2008
Firstpage
255
Lastpage
266
Abstract
A method is described for estimating the fundamental frequencies of several concurrent sounds in polyphonic music and multiple-speaker speech signals. The method consists of a computational model of the human auditory periphery, followed by a periodicity analysis mechanism where fundamental frequencies are iteratively detected and canceled from the mixture signal. The auditory model needs to be computed only once, and a computationally efficient strategy is proposed for implementing it. Simulation experiments were made using mixtures of musical sounds and mixed speech utterances. The proposed method outperformed two reference methods in the evaluations and showed a high level of robustness in processing signals where important parts of the audible spectrum were deleted to simulate bandlimited interference. Different system configurations were studied to identify the conditions where pitch analysis using an auditory model is advantageous over conventional time or frequency domain approaches.
Keywords
audio signal processing; frequency estimation; iterative methods; music; physiological models; signal detection; speech processing; auditory model; computational model; frequency estimation; human auditory periphery; iterative detection; multipitch analysis; multiple-speaker speech signals; periodicity analysis mechanism; polyphonic music; speech signal; Acoustic signal analysis; fundamental frequency estimation; music information retrieval; pitch perception;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2007.908129
Filename
4358092
Link To Document