Title :
An algebraic approach to data mining: some examples
Author :
Grossman, Robert L. ; Larson, Richard G.
Author_Institution :
Lab. for Adv. Comput., Illinois Univ., Chicago, IL, USA
Abstract :
We introduce an algebraic approach to the foundations of data mining. Our approach is based upon two algebras of functions defined over a common state space X and a pairing between them. One algebra is an algebra of state space observations, and the other is an algebra of labeled sets of states. We interpret H as the algebraic encoding of the data and the pairing as the misclassification rate when the classifier f is applied to the set of states X. We give a realization theorem giving conditions on formal series of data sets built from D that imply there is a realization involving a state space X, a classifier f ∈ R and a set of labeled states χ ∈ R0 that yield this series.
Keywords :
algebra; data mining; database theory; pattern classification; very large databases; algebraic approach; algebraic encoding; classifier; data mining; data sets; functions; labeled sets of states; large database; misclassification rate; state space observations; Algebra; Control theory; Convergence; Data mining; Encoding; Erbium; Laboratories; Learning automata; Predictive models; State-space methods;
Conference_Titel :
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7695-1754-4
DOI :
10.1109/ICDM.2002.1184011