• DocumentCode
    35496
  • Title

    Spatial Source Subtraction Based on Incomplete Measurements of Relative Transfer Function

  • Author

    Koldovsky, Zbynek ; Malek, Jiri ; Gannot, Sharon

  • Author_Institution
    Fac. of Mechatron., Inf., & Interdiscipl. Studies, Tech. Univ. of Liberec, Liberec, Czech Republic
  • Volume
    23
  • Issue
    8
  • fYear
    2015
  • fDate
    Aug. 2015
  • Firstpage
    1335
  • Lastpage
    1347
  • Abstract
    Relative impulse responses between microphones are usually long and dense due to the reverberant acoustic environment. Estimating them from short and noisy recordings poses a long-standing challenge of audio signal processing. In this paper, we apply a novel strategy based on ideas of compressed sensing. Relative transfer function (RTF) corresponding to the relative impulse response can often be estimated accurately from noisy data but only for certain frequencies. This means that often only an incomplete measurement of the RTF is available. A complete RTF estimate can be obtained through finding its sparsest representation in the time-domain: that is, through computing the sparsest among the corresponding relative impulse responses. Based on this approach, we propose to estimate the RTF from noisy data in three steps. First, the RTF is estimated using any conventional method such as the nonstationarity-based estimator by Gannot or through blind source separation. Second, frequencies are determined for which the RTF estimate appears to be accurate. Third, the RTF is reconstructed through solving a weighted l1 convex program, which we propose to solve via a computationally efficient variant of the SpaRSA (Sparse Reconstruction by Separable Approximation) algorithm. An extensive experimental study with real-world recordings has been conducted. It has been shown that the proposed method is capable of improving many conventional estimators used as the first step in most situations.
  • Keywords
    audio signal processing; blind source separation; compressed sensing; convex programming; microphones; signal reconstruction; signal representation; time-domain analysis; transfer functions; RTF reconstruction; SpaRSA; audio signal processing; blind source separation; compressed sensing; microphone; relative impulse response; relative transfer function incomplete measurement; reverberant acoustic environment; sparse reconstruction by separable approximation algorithm; spatial source subtraction; time-domain; weighted l1 convex program; Approximation methods; Frequency estimation; Frequency-domain analysis; Microphones; Noise; Noise measurement; Noise reduction; ${ell _1}$ norm; Compressed sensing; relative impulse response; relative transfer function (RTF); sparse approximations;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2425213
  • Filename
    7090956