Title :
Maximizing expected gain in supervised discrete Bayesian classification when fusing binary valued features
Author :
Lynch, Robert S., Jr. ; Willett, Peter K.
Author_Institution :
Signal Process. Branch, Naval Undersea Warfare Center, Newport, RI, USA
Abstract :
In this paper, previously reported work is extended for fusing binary valued features. In general, when mining discrete data to train supervised discrete Bayesian classifiers, it is often of interest to determine the best threshold setting for maximizing performance. In this work, we utilize a discrete Bayesian classification model, a gain function, to determine the best threshold setting for a given number of binary valued training data under each class. Results are demonstrated for simulated data by plotting the expected gain versus threshold settings for different numbers of training data. In general, it is shown that the expected gain reaches a maximum at a certain threshold. Further, this maximum point varies with the overall quantization of the data. Additional results are also shown for a different gain function on the decision variable, that are used to extend previously reported results.
Keywords :
belief networks; data mining; pattern classification; binary valued features; binary valued training data; data quantization; discrete data mining; gain function; supervised discrete Bayesian classification; Bayesian methods; Data mining; Data models; Quantization; Signal processing; Statistics; Supervised learning; Testing; Training data; Discrete binary data; Gain function; Noninformative prior; Unknown data distribution;
Conference_Titel :
Information Fusion, 2009. FUSION '09. 12th International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
978-0-9824-4380-4