• DocumentCode
    730086
  • Title

    Audio source separation using a redundant library of source spectral bases for non-negative tensor factorization

  • Author

    Fakhry, Mahmoud ; Svaizer, Piergiorgio ; Omologo, Maurizio

  • Author_Institution
    Univ. of Trento, Trento, Italy
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    251
  • Lastpage
    255
  • Abstract
    This work proposes a solution to the problem of under-determined audio source separation using pre-trained redundant source-based prior information. In local Gaussian modeling of a mixing process, an observed mixture is modeled by a Gaussian distribution parameterized by source variances and spatial covariance matrices. The separation is performed by estimating the parameters, and applying Wiener filtering on the observed mixture. We propose, in a training phase, to build a redundant library of spectral basis matrices of all probable source power spectra, applying non-negative tensor factorization (NTF). In the testing phase, the matrices that match the observed mixture are detected using NTF. With the help of the detected matrices, a maximum likelihood algorithm is proposed in order to iteratively estimate the parameters of the model, exploiting the spatial redundancy of the observed mixture and using NTF. The proposed algorithm proves more flexibility and efficiency with respect to a baseline algorithm used as a reference.
  • Keywords
    Gaussian distribution; Wiener filters; audio signal processing; covariance matrices; matrix decomposition; maximum likelihood detection; source separation; sparse matrices; spectral analysis; Gaussian distribution; Gaussian modeling; NTF; Wiener filtering; audio source separation; maximum likelihood algorithm; nonnegative tensor factorization; power spectra; redundant library; source spectral bases; spatial covariance matrices; spectral basis matrices; Covariance matrices; Libraries; Matrix decomposition; Microphones; Source separation; Speech; Tensile stress; Spectral bases; audio source separation; model parameters; non-negative tensor factorization; redundant library;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7177970
  • Filename
    7177970