Title :
A Deep Web Complex Matching Method Based on Association Mining and Semantic Clustering
Author :
Yang Xiaoqin ; Shiguang, Ju ; Qinghuang, Cao
Author_Institution :
Coll. of Comput., Jiangsu Univ., Zhenjiang, China
Abstract :
In order to improve the efficiency and accuracy of deep Web interface matching, a method based on the existing dual correlation mining (DCM) method using association mining and semantic clustering was presented in this paper. While digging group attributes by using correlation algorithm, a new correlation measure based on mutual information was introduced and realized by matrix to resolve the inefficiency problem. The attributes were clustered to synonymous attributes by their similarity which was computed by using semantic net. By the compare on more than 200 interfaces in 4 domains, the experiment results indicate that the improved method has greatly heighted than DCM in the respect of efficiency and accuracy.
Keywords :
Internet; data mining; association mining; deep Web complex matching method; deep Web interface matching; dual correlation mining; mutual information; semantic clustering; Application software; Clustering algorithms; Computer interfaces; Databases; Educational institutions; Electronic mail; Information systems; Matrix converters; Mutual information; Pattern matching; Clustering; Complex Matching; Deep Web;
Conference_Titel :
Web Information Systems and Applications Conference, 2009. WISA 2009. Sixth
Conference_Location :
Xuzhou, Jiangsu
Print_ISBN :
978-0-7695-3874-7
DOI :
10.1109/WISA.2009.17