• DocumentCode
    3510270
  • Title

    Blind sparse source separation for unknown number of sources using Gaussian mixture model fitting with Dirichlet prior

  • Author

    Araki, Shoko ; Nakatani, Tomohiro ; Sawada, Hiroshi ; Makino, Shoji

  • Author_Institution
    NTT Commun. Sci. Labs., NTT Corp., Kyoto
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    33
  • Lastpage
    36
  • Abstract
    In this paper, we propose a novel sparse source separation method that can be applied even if the number of sources is unknown. Recently, many sparse source separation approaches with time-frequency masks have been proposed. However, most of these approaches require information on the number of sources in advance. In our proposed method, we model the histogram of the estimated direction of arrival (DOA) with a Gaussian mixture model (GMM) with a Dirichlet prior. Then we estimate the model parameters by using the maximum a posteriori estimation based on the EM algorithm. In order to avoid one cluster being modeled by two or more Gaussians, we utilize a sparse distribution modeled by the Dirichlet distributions as the prior of the GMM mixture weight. By using this prior, without any specific model selection process, our proposed method can estimate the number of sources and time-frequency masks simultaneously. Experimental results show the performance of our proposed method.
  • Keywords
    Gaussian distribution; blind source separation; direction-of-arrival estimation; expectation-maximisation algorithm; time-frequency analysis; Dirichlet distribution; EM algorithm; GMM mixture weight; Gaussian mixture model fitting; blind sparse source separation; direction-of-arrival estimation; histogram; maximum a posteriori estimation; time-frequency masks; Blind source separation; Clustering algorithms; Direction of arrival estimation; Histograms; Independent component analysis; Maximum a posteriori estimation; Parameter estimation; Source separation; Speech; Time frequency analysis; Blind source separation; Dirichlet distribution; number of sources; prior; sparse;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4959513
  • Filename
    4959513