DocumentCode
1626160
Title
WebIQ: Learning from the Web to Match Deep-Web Query Interfaces
Author
Wu, Wensheng ; Doan, AnHai ; Yu, Clement
Author_Institution
University of Illinois, Urbana
fYear
2006
Firstpage
44
Lastpage
44
Abstract
Integrating Deep Web sources requires highly accurate semantic matches between the attributes of the source query interfaces. These matches are usually established by comparing the similarities of the attributes’ labels and instances. However, attributes on query interfaces often have no or very few data instances. The pervasive lack of instances seriously reduces the accuracy of current matching techniques. To address this problem, we describe WebIQ, a solution that learns from both the Surface Web and the Deep Web to automatically discover instances for interface attributes. WebIQ extends question answering techniques commonly used in the AI community for this purpose. We describe how to incorporate WebIQ into current interface matching systems. Extensive experiments over five realworld domains show the utility ofWebIQ. In particular, the results show that acquired instances help improve matching accuracy from 89.5% F-1 to 97.5%, at only a modest runtime overhead.
Keywords
Artificial intelligence; Books; Cities and towns; Databases; Motion pictures; Runtime;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2006. ICDE '06. Proceedings of the 22nd International Conference on
Print_ISBN
0-7695-2570-9
Type
conf
DOI
10.1109/ICDE.2006.172
Filename
1617412
Link To Document