Title :
Multiple-Instance Regression with Structured Data
Author :
Wagstaff, Kiri L. ; Lane, Terran ; Roper, Alex
Author_Institution :
Jet Propulsion Lab., California Inst. of Technol., Pasadena, CA
Abstract :
We present a multiple-instance regression algorithm that models internal bag structure to identify the items most relevant to the bag labels. Multiple-instance regression (MIR) operates on a set of bags with real-valued labels, each containing a set of unlabeled items, in which the relevance of each item to its bag label is unknown. The goal is to predict the labels of new bags from their contents. Unlike previous MIR methods, MI-ClusterRegress can operate on bags that are structured in that they contain items drawn from a number of distinct (but unknown) distributions. MI-ClusterRegress simultaneously learns a model of the bagpsilas internal structure, the relevance of each item, and a regression model that accurately predicts labels for new bags. We evaluated this approach on the challenging MIR problem of crop yield prediction from remote sensing data. MI-ClusterRegress provided predictions that were more accurate than those obtained with non-multiple-instance approaches or MIR methods that do not model the bag structure.
Keywords :
pattern clustering; regression analysis; MI-ClusterRegress; MIR methods; bag labels; crop yield prediction; distinct distributions; internal bag structure; multiple instance regression; real-valued labels; regression model; remote sensing data; structured data; Conferences; Crops; Data mining; Drugs; Laboratories; MODIS; Predictive models; Propulsion; Remote sensing; US Department of Agriculture; crop yield prediction; multiple-instance regression; structured data;
Conference_Titel :
Data Mining Workshops, 2008. ICDMW '08. IEEE International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-0-7695-3503-6
Electronic_ISBN :
978-0-7695-3503-6
DOI :
10.1109/ICDMW.2008.31