مرکز منطقه ای اطلاع رساني علوم و فناوري - A random finite set model for data clustering

DocumentCode :

263296

Title :

A random finite set model for data clustering

Author :

Dinh Phung ; Ba-Ngu Vo

Author_Institution :

Center for Pattern Recognition & Data Analytics, Deakin Univ., Geelong, VIC, Australia

fYear :

2014

fDate :

7-10 July 2014

Firstpage :

Lastpage :

Abstract :

The goal of data clustering is to partition data points into groups to optimize a given objective function. While most existing clustering algorithms treat each data point as vector, in many applications each datum is not a vector but a point pattern or a set of points. Moreover, many existing clustering methods require the user to specify the number of clusters, which is not available in advance. This paper proposes a new class of models for data clustering that addresses set-valued data as well as unknown number of clusters, using a Dirichlet Process mixture of Poisson random finite sets. We also develop an efficient Markov Chain Monte Carlo posterior inference technique that can learn the number of clusters and mixture parameters automatically from the data. Numerical studies are presented to demonstrate the salient features of this new model, in particular its capacity to discover extremely unbalanced clusters in data.

Keywords :

Markov processes; Monte Carlo methods; learning (artificial intelligence); pattern clustering; probability; Dirichlet process mixture; Markov Chain Monte Carlo posterior inference technique; Poisson random finite set model; data clustering; data point partitioning; learning; objective function optimization; point pattern; set-valued data; Bayes methods; Data models; Density measurement; Markov processes; Monte Carlo methods; Random variables; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Information Fusion (FUSION), 2014 17th International Conference on

Conference_Location :

Salamanca

Type :

conf

Filename :

6916264

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=263296