DocumentCode
3322109
Title
Standing Out in a Crowd: Selecting Attributes for Maximum Visibility
Author
Miah, Muhammed ; Das, Gautam ; Hristidis, Vagelis ; Mannila, Heikki
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of Texas at Arlington, Arlington, TX
fYear
2008
fDate
7-12 April 2008
Firstpage
356
Lastpage
365
Abstract
In recent years, there has been significant interest in development of ranking functions and efficient top-k retrieval algorithms to help users in ad-hoc search and retrieval in databases (e.g., buyers searching for products in a catalog). In this paper we focus on a novel and complementary problem: how to guide a seller in selecting the best attributes of a new tuple (e.g., new product) to highlight such that it stands out in the crowd of existing competitive products and is widely visible to the pool of potential buyers. We develop several interesting formulations of this problem. Although these problems are NP-complete, we can give several exact algorithms as well as approximation heuristics that work well in practice. Our exact algorithms are based on integer programming (IP) formulations of the problems, as well as on adaptations of maximal frequent itemset mining algorithms, while our approximation algorithms are based on greedy heuristics. We conduct a performance study illustrating the benefits of our methods on real as well as synthetic data.
Keywords
data mining; greedy algorithms; information retrieval; integer programming; retail data processing; search engines; NP-complete problem; approximation algorithm; attribute selection; greedy heuristics; integer programming; maximal frequent itemset mining algorithm; ranking function; retail data processing; top-k retrieval algorithm; Algorithm design and analysis; Approximation algorithms; Automobile manufacture; Data mining; Information retrieval; Itemsets; Linear programming; Manufacturing; New products catalog; Relational databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location
Cancun
Print_ISBN
978-1-4244-1836-7
Electronic_ISBN
978-1-4244-1837-4
Type
conf
DOI
10.1109/ICDE.2008.4497444
Filename
4497444
Link To Document