DocumentCode :
2456268
Title :
Entity Search Strategies for Mashup Applications
Author :
Endrullis, Stefan ; Thor, Andreas ; Rahm, Erhard
Author_Institution :
Univ. of Leipzig, Leipzig, Germany
fYear :
2012
fDate :
1-5 April 2012
Firstpage :
66
Lastpage :
77
Abstract :
Programmatic data integration approaches such as mashups have become a viable approach to dynamically integrate web data at runtime. Key data sources for mashups include entity search engines and hidden databases that need to be queried via source-specific search interfaces or web forms. Current mashups are typically restricted to simple query approaches such as using keyword search. Such approaches may need a high number of queries if many objects have to be found. Furthermore, the effectiveness of the queries may be limited, i.e., they may miss relevant results. We therefore propose more advanced search strategies that aim at finding a set of entities with high efficiency and high effectiveness. Our strategies use different kinds of queries that are determined by source-specific query generators. Furthermore, the queries are selected based on the characteristics of input entities. We introduce a flexible model for entity search strategies that includes a ranking of candidate queries determined by different query generators. We describe different query generators and outline their use within four entity search strategies. These strategies apply different query ranking and selection approaches to optimize efficiency and effectiveness. We evaluate our search strategies in detail for two domains: product search and publication search. The comparison with a standard keyword search shows that the proposed search strategies provide significant improvements in both domains.
Keywords :
Internet; data integration; optimisation; query formulation; search engines; Web forms; candidate queries; dynamical Web data integration; entity search engine; entity search strategies; flexible model; hidden database; key data sources; mashup applications; product search; programmatic data integration approaches; publication search; query ranking; query selection approach; source-specific query generators; source-specific search interfaces; standard keyword search; Books; Complexity theory; Conferences; Data engineering; Generators; Search engines; Search problems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2012 IEEE 28th International Conference on
Conference_Location :
Washington, DC
ISSN :
1063-6382
Print_ISBN :
978-1-4673-0042-1
Type :
conf
DOI :
10.1109/ICDE.2012.84
Filename :
6228073
Link To Document :
بازگشت