• DocumentCode
    106508
  • Title

    Multichannel High-Resolution NMF for Modeling Convolutive Mixtures of Non-Stationary Signals in the Time-Frequency Domain

  • Author

    Badeau, Roland ; Plumbley, Mark D.

  • Author_Institution
    Institut Mines-Telecom, Telecom ParisTech, CNRS LTCI, Paris, France
  • Volume
    22
  • Issue
    11
  • fYear
    2014
  • fDate
    Nov. 2014
  • Firstpage
    1670
  • Lastpage
    1680
  • Abstract
    Several probabilistic models involving latent components have been proposed for modeling time-frequency (TF) representations of audio signals such as spectrograms, notably in the nonnegative matrix factorization (NMF) literature. Among them, the recent high-resolution NMF (HR-NMF) model is able to take both phases and local correlations in each frequency band into account, and its potential has been illustrated in applications such as source separation and audio inpainting. In this paper, HR-NMF is extended to multichannel signals and to convolutive mixtures. The new model can represent a variety of stationary and non-stationary signals, including autoregressive moving average (ARMA) processes and mixtures of damped sinusoids. A fast variational expectation-maximization (EM) algorithm is proposed to estimate the enhanced model. This algorithm is applied to piano signals, and proves capable of accurately modeling reverberation, restoring missing observations, and separating pure tones with close frequencies.
  • Keywords
    Convolution; Equations; Hafnium; Mathematical model; Speech; Time-domain analysis; Time-frequency analysis; Multichannel signal analysis; non-stationary signal modeling; nonnegative matrix factorization; time-frequency analysis; variational EM algorithm;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2014.2341920
  • Filename
    6862864