DocumentCode :
773396
Title :
An integrated framework for visualized and exploratory pattern discovery in mixed data
Author :
Hsu, Chung-Chian ; Wang, Sheng-Hsuan
Author_Institution :
Dept. of Inf. Manage., Nat. Yunlin Univ. of Sci. & Technol., Taiwan
Volume :
18
Issue :
2
fYear :
2006
Firstpage :
161
Lastpage :
173
Abstract :
Data mining uncovers hidden, previously unknown, and potentially useful information from large amounts of data. Compared to the traditional statistical and machine learning data analysis techniques, data mining emphasizes providing a convenient and complete environment for the data analysis. In this paper, we propose an integrated framework for visualized, exploratory data clustering, and pattern extraction from mixed data. We further discuss its implementation techniques: a generalized self-organizing map (GSOM) and an extended attribute-oriented induction (EAOI), which not only overcome the drawbacks of their original algorithms, but also provide additional analysis capabilities. Specifically, the GSOM facilitates the direct handling of mixed data, including categorical and numeric values. The EAOI enables exploration for major values hidden in the data and, in addition, offers an alternative for processing numeric attributes, instead of generalizing them. A prototype was developed for experiments with synthetic and real data sets, and comparison with those of the traditional approaches. The results confirmed the feasibility of the framework and the superiority of the extended techniques.
Keywords :
data analysis; data mining; data visualisation; pattern clustering; very large databases; EAOI; GSOM; data mining; exploratory data clustering; exploratory pattern discovery; extended attribute-oriented induction; generalized self-organizing map; machine learning data analysis technique; mixed data analysis; pattern extraction; statistical data analysis technique; visualized data clustering; visualized pattern discovery; Algorithm design and analysis; Artificial intelligence; Clustering algorithms; Data analysis; Data mining; Data visualization; Machine learning; Prototypes; Statistical analysis; Visual databases; Index Terms- Attribute-oriented induction; clustering; data mining; pattern discovery; self-organizing map.;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2006.23
Filename :
1563980
Link To Document :
بازگشت