• DocumentCode
    2490711
  • Title

    Robust mixture modeling using the Pearson type VII distribution

  • Author

    Sun, Jianyong ; Kabán, Ata ; Garibaldi, Jonathan M.

  • Author_Institution
    Centre for Plant Integrative Biol., Univ. of Nottingham, Nottingham, UK
  • fYear
    2010
  • fDate
    18-23 July 2010
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    A mixture of Student t-distributions (MoT) has been widely used to model multivariate data sets with atypical observations, or outliers for robust clustering. In this paper, we developed a novel robust clustering approach by modeling the data sets using mixture of Pearson type VII distributions (MoP). An EM algorithm is developed for the maximum likelihood estimation of the model parameters. An outlier detection criterion is derived from the EM solution. Controlled experimental results on the synthetic datasets show that the MoP is more viable than the MoT. The MoP performs comparably if not better, on average, in terms of outlier detection accuracy and out-of-sample log-likelihood with the MoT. Furthermore, we compared the performances of the Pearson type VII and the student t mixtures on the classification of several benchmark pattern recognition data sets. The comparison favours the developed Pearson type VII mixtures.
  • Keywords
    expectation-maximisation algorithm; statistical analysis; EM algorithm; MoP; MoT; Pearson type VII distribution; maximum likelihood estimation; multivariate data sets; outlier detection criterion; robust clustering approach; robust mixture modeling; Data models; Equations; Maximum likelihood estimation; Robustness; Testing; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2010 International Joint Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1098-7576
  • Print_ISBN
    978-1-4244-6916-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2010.5596560
  • Filename
    5596560