DocumentCode :
1902125
Title :
Concept Capture Based On Column Matching and Clustering
Author :
Zhou, Jingtao ; Wang, Mingwei ; Zhao, Han ; Zhang, Shusheng ; Zhang, Chao
Author_Institution :
Key Lab. of Contemporary Design & Integrated Manuf. Technol., Northwestern Polytech. Univ., Xi´´an
fYear :
2005
fDate :
27-29 Nov. 2005
Firstpage :
71
Lastpage :
71
Abstract :
Building ontology from scratch need identify the basic concepts of application domain. In terms of database integration, draft concepts can be directly captured by processing schemas of databases. In this context, we present an automatic approach based on matching and clustering of relational schema columns to capture concepts from relative databases. By combining three individual name matchers following a composite way, the matching phase computes the similarity between column names, which will be used as classifiers for clustering. A neural network matcher is proposed in clustering phase to categorize columns of schemas into clusters by using column constraints with the results from matching phase for joint consideration of multiple criteria. Finally, each concept is defined as a cluster of columns representing the same meaning. The concepts discovered by our approach can be used as draft material or seeds for further comprehensive concept capture.
Keywords :
neural nets; pattern clustering; pattern matching; relational databases; column clustering; column matching; concept capture; database integration; neural network matcher; relational schema columns; relative databases; Buildings; Chaos; Clustering algorithms; Educational technology; Laboratories; Manufacturing; Neural networks; Ontologies; Relational databases; Tin;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Semantics, Knowledge and Grid, 2005. SKG '05. First International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7695-2534-2
Electronic_ISBN :
0-7695-2534-2
Type :
conf
DOI :
10.1109/SKG.2005.52
Filename :
4125859
Link To Document :
بازگشت