• DocumentCode
    940440
  • Title

    Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model

  • Author

    Klapuri, Anssi

  • Author_Institution
    Inst. of Signal Process., Tampere Univ. of Technol., Tampere
  • Volume
    16
  • Issue
    2
  • fYear
    2008
  • Firstpage
    255
  • Lastpage
    266
  • Abstract
    A method is described for estimating the fundamental frequencies of several concurrent sounds in polyphonic music and multiple-speaker speech signals. The method consists of a computational model of the human auditory periphery, followed by a periodicity analysis mechanism where fundamental frequencies are iteratively detected and canceled from the mixture signal. The auditory model needs to be computed only once, and a computationally efficient strategy is proposed for implementing it. Simulation experiments were made using mixtures of musical sounds and mixed speech utterances. The proposed method outperformed two reference methods in the evaluations and showed a high level of robustness in processing signals where important parts of the audible spectrum were deleted to simulate bandlimited interference. Different system configurations were studied to identify the conditions where pitch analysis using an auditory model is advantageous over conventional time or frequency domain approaches.
  • Keywords
    audio signal processing; frequency estimation; iterative methods; music; physiological models; signal detection; speech processing; auditory model; computational model; frequency estimation; human auditory periphery; iterative detection; multipitch analysis; multiple-speaker speech signals; periodicity analysis mechanism; polyphonic music; speech signal; Acoustic signal analysis; fundamental frequency estimation; music information retrieval; pitch perception;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2007.908129
  • Filename
    4358092