Title :
A study on a fuzzy clustering for mixed numerical and categorical incomplete data
Author :
Furukawa, Toshihiro ; Ohnishi, Shin-ichi ; Yamanoi, Takashi
Author_Institution :
Grad. Sch. of Eng., Hokkai-Gakuen Univ., Sapporo, Japan
Abstract :
Most clustering methods focus on numerical data. However, most data existing in databases are both categorical and numerical. To date, clustering methods have been developed to analyze only complete data. Although we sometimes encounter data sets that contain one or more missing feature values (incomplete data), traditional clustering methods cannot be used for such data. Thus, we study this theme and discuss clustering methods that can handle mixed numerical and categorical incomplete data. In this paper, we propose an algorithm that uses the missing categorical data imputation method and distances between numerical data that contain missing values. Furthermore, we apply fuzzy clustering for interpreting results that are vague.
Keywords :
database theory; fuzzy set theory; numerical analysis; pattern clustering; categorical incomplete data; complete data; data sets; databases; fuzzy clustering; missing categorical data imputation method; missing feature values; mixed numerical data; Algorithm design and analysis; Clustering algorithms; Clustering methods; Conferences; Cybernetics; Databases; Educational institutions;
Conference_Titel :
Fuzzy Theory and Its Applications (iFUZZY), 2013 International Conference on
Conference_Location :
Taipei
DOI :
10.1109/iFuzzy.2013.6825477