Title :
A Bayesian Nonparametric Model for Joint Relation Integration and Domain Clustering
Author :
Li, Dazhuo ; Mohammad, Fahim ; Rouchka, Eric
Author_Institution :
Dept. of Comput. Eng. & Comput. Sci., Univ. of Louisville, Louisville, KY, USA
Abstract :
Relational databases provide unprecedented opportunities for knowledge discovery. Various approaches have been proposed to infer structures over entity types and predict relationships among elements of these types. However, discovering structures beyond the entity type level, e.g. clustering over relation concepts, remains a challenging task. We present a Bayesian nonparametric model for joint relation and domain clustering. The model can automatically infer the number of relation clusters, which is particularly important in novel cases where little prior knowledge is known about the number of relation clusters. The approach is applied to clustering various relations in a gene database.
Keywords :
Bayes methods; biology computing; data mining; genomics; pattern clustering; relational databases; Bayesian nonparametric model; domain clustering; gene database; joint relation integration; knowledge discovery; relational databases; Approximation methods; Bayesian methods; Databases; Joints; Machine learning; Proteins;
Conference_Titel :
Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-9211-4
DOI :
10.1109/ICMLA.2010.168