مرکز منطقه ای اطلاع رساني علوم و فناوري - Fuzzy c-means clustering of incomplete data

DocumentCode :

1539705

Title :

Fuzzy c-means clustering of incomplete data

Author :

Hathaway, Richard J. ; Bezdek, James C.

Author_Institution :

Math. & Comput. Sci. Dept., Georgia Southern Univ., Statesboro, GA, USA

Volume :

Issue :

fYear :

2001

fDate :

10/1/2001 12:00:00 AM

Firstpage :

735

Lastpage :

744

Abstract :

The problem of clustering a real s-dimensional data set X={x₁,…,x_n} ⊂ R^s is considered. Usually, each observation (or datum) consists of numerical values for all s features (such as height, length, etc.), but sometimes data sets can contain vectors that are missing one or more of the feature values. For example, a particular datum x_k might be incomplete, having the form x_k=(254.3, ?, 333.2, 47.45, ?)^T, where the second and fifth feature values are missing. The fuzzy c-means (FCM) algorithm is a useful tool for clustering real s-dimensional data, but it is not directly applicable to the case of incomplete data. Four strategies for doing FCM clustering of incomplete data sets are given, three of which involve modified versions of the FCM algorithm. Numerical convergence properties of the new algorithms are discussed, and all approaches are tested using real and artificially generated incomplete data sets

Keywords :

fuzzy logic; pattern clustering; convergence properties; fuzzy c-means clustering; incomplete data; real s-dimensional data set; Clustering algorithms; Computer science; Convergence of numerical methods; Fuzzy logic; Mathematics; Maximum likelihood estimation; Parameter estimation; Pattern recognition; Performance analysis; Testing;

fLanguage :

English

Journal_Title :

Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on

Publisher :

ieee

ISSN :

1083-4419

Type :

jour

DOI :

10.1109/3477.956035

Filename :

956035

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1539705