Title :
Utilizing Fused Features to Mine Unknown Clusters in Training Data
Author :
Lynch, Robert S., Jr. ; Willett, Peter K.
Author_Institution :
Signal Process. Branch, Naval Undersea Warfare Center, Newport, RI
Abstract :
In this paper, a previously introduced data mining technique, utilizing the mean field Bayesian data reduction algorithm (BDRA), is extended for use in finding unknown data clusters in a fused multidimensional feature space. In the BDRA the modeling assumption is that the discrete symbol probabilities of each class are a priori uniformly Dirichlet distributed, and where the primary metric for selecting and discretizing all relevant features is an analytic formula for the probability of error conditioned on the training data. In extending the BDRA for this application, notice that its built-in dimensionality reduction aspects are exploited for isolating and automatically sorting out and mining all points contained in each unknown data cluster. To illustrate performance, results are demonstrated using simulated data containing multiple clusters, and where the fused feature space contains relevant classification information
Keywords :
data mining; error statistics; pattern classification; pattern clustering; probability; sensor fusion; Bayesian data reduction algorithm; Dirichlet distribution; classification information; data mining technique; discrete symbol probability; error probability; fused multidimensional feature space; mean field BDRA; mine unknown cluster; primary metric; training data; Bayesian methods; Clustering algorithms; Data mining; Investments; Multidimensional signal processing; Signal processing algorithms; Sorting; Target recognition; Testing; Training data; Adaptive classification; Discrete data; Level two fusion; Unknown data distribution;
Conference_Titel :
Information Fusion, 2006 9th International Conference on
Conference_Location :
Florence
Print_ISBN :
1-4244-0953-5
Electronic_ISBN :
0-9721844-6-5
DOI :
10.1109/ICIF.2006.301761