Title :
Leveraging relational autocorrelation with latent group models
Author :
Neville, Jennifer ; Jensen, David
Author_Institution :
Dept. of Comput. Sci., Massachusetts Univ., Amherst, MA, USA
Abstract :
The presence of autocorrelation provides a strong motivation for using relational learning and inference techniques. Autocorrelation is a statistical dependence between the values of the same variable on related entities and is a nearly ubiquitous characteristic of relational data sets. Recent research has explored the use of collective inference techniques to exploit this phenomenon. These techniques achieve significant performance gains by modeling observed correlations among class labels of related instances, but the models fail to capture a frequent cause of autocorrelation - the presence of underlying groups that influence the attributes on a set of entities. We propose a latent group model (LGM) for relational data, which discovers and exploits the hidden structures responsible for the observed autocorrelation among class labels. Modeling the latent group structure improves model performance, increases inference efficiency, and enhances our understanding of the datasets. We evaluate performance on three relational classification tasks and show that LGM outperforms models that ignore latent group structure, particularly when there is little information with which to seed inference.
Keywords :
inference mechanisms; learning (artificial intelligence); relational databases; collective inference; latent group model; latent group models; latent group structure; relational autocorrelation; relational classification; relational data sets; relational learning; statistical dependence; Advertising; Autocorrelation; Computer science; Data mining; Graphical models; Motion pictures; Performance gain; Predictive models; Web pages; Web sites;
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
Print_ISBN :
0-7695-2278-5
DOI :
10.1109/ICDM.2005.89