• DocumentCode
    1493954
  • Title

    Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model

  • Author

    Duong, Ngoc Q K ; Vincent, Emmanuel ; Gribonval, Rémi

  • Author_Institution
    Centre Inria Rennes-Bretagne Atlantique, INRIA, Rennes, France
  • Volume
    18
  • Issue
    7
  • fYear
    2010
  • Firstpage
    1830
  • Lastpage
    1840
  • Abstract
    This paper addresses the modeling of reverberant recording environments in the context of under-determined convolutive blind source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a zero-mean Gaussian random variable whose covariance encodes the spatial characteristics of the source. We then consider four specific covariance models, including a full-rank unconstrained model. We derive a family of iterative expectation-maximization (EM) algorithms to estimate the parameters of each model and propose suitable procedures adapted from the state-of-the-art to initialize the parameters and to align the order of the estimated sources across all frequency bins. Experimental results over reverberant synthetic mixtures and live recordings of speech data show the effectiveness of the proposed approach.
  • Keywords
    audio signal processing; covariance analysis; source separation; audio source separation; blind source separation; covariance encodes; full-rank unconstrained model; iterative expectation-maximization algorithms; mixture channels; spatial characteristics; spatial covariance model; time frequency domain; underdetermined reverberant; zero-mean Gaussian random variable; Audio recording; Blind source separation; Context modeling; Frequency estimation; Iterative algorithms; Parameter estimation; Random variables; Source separation; State estimation; Time frequency analysis; Convolutive blind source separation (BSS); expectation–maximization (EM) algorithm; permutation problem; spatial covariance models; under-determined mixtures;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2050716
  • Filename
    5466223