Title :
Concept Capture Based On Column Matching and Clustering
Author :
Zhou, Jingtao ; Wang, Mingwei ; Zhao, Han ; Zhang, Shusheng ; Zhang, Chao
Author_Institution :
Key Lab. of Contemporary Design & Integrated Manuf. Technol., Northwestern Polytech. Univ., Xi´´an
Abstract :
Building ontology from scratch need identify the basic concepts of application domain. In terms of database integration, draft concepts can be directly captured by processing schemas of databases. In this context, we present an automatic approach based on matching and clustering of relational schema columns to capture concepts from relative databases. By combining three individual name matchers following a composite way, the matching phase computes the similarity between column names, which will be used as classifiers for clustering. A neural network matcher is proposed in clustering phase to categorize columns of schemas into clusters by using column constraints with the results from matching phase for joint consideration of multiple criteria. Finally, each concept is defined as a cluster of columns representing the same meaning. The concepts discovered by our approach can be used as draft material or seeds for further comprehensive concept capture.
Keywords :
neural nets; pattern clustering; pattern matching; relational databases; column clustering; column matching; concept capture; database integration; neural network matcher; relational schema columns; relative databases; Buildings; Chaos; Clustering algorithms; Educational technology; Laboratories; Manufacturing; Neural networks; Ontologies; Relational databases; Tin;
Conference_Titel :
Semantics, Knowledge and Grid, 2005. SKG '05. First International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7695-2534-2
Electronic_ISBN :
0-7695-2534-2
DOI :
10.1109/SKG.2005.52