DocumentCode :
108879
Title :
Consensus-Based Ranking of Multivalued Objects: A Generalized Borda Count Approach
Author :
Ying Zhang ; Wenjie Zhang ; Jian Pei ; Xuemin Lin ; Qianlu Lin ; Aiping Li
Author_Institution :
Sch. of Comput. Sci. & Eng., Univ. of New South Wales, Sydney, NSW, Australia
Volume :
26
Issue :
1
fYear :
2014
fDate :
Jan. 2014
Firstpage :
83
Lastpage :
96
Abstract :
In this paper, we tackle a novel problem of ranking multivalued objects, where an object has multiple instances in a multidimensional space, and the number of instances per object is not fixed. Given an ad hoc scoring function that assigns a score to a multidimensional instance, we want to rank a set of multivalued objects. Different from the existing models of ranking uncertain and probabilistic data, which model an object as a random variable and the instances of an object are assumed exclusive, we have to capture the coexistence of instances here. To tackle the problem, we advocate the semantics of favoring widely preferred objects instead of majority votes, which is widely used in many elections and competitions. Technically, we borrow the idea from Borda Count (BC), a well-recognized method in consensus-based voting systems. However, Borda Count cannot handle multivalued objects of inconsistent cardinality, and is costly to evaluate top (k) queries on large multidimensional data sets. To address the challenges, we extend and generalize Borda Count to quantile-based Borda Count, and develop efficient computational methods with comprehensive cost analysis. We present case studies on real data sets to demonstrate the effectiveness of the generalized Borda Count ranking, and use synthetic and real data sets to verify the efficiency of our computational method.
Keywords :
probability; query processing; random processes; BC; consensus-based ranking; consensus-based voting systems; cost analysis; generalized Borda count approach; hoc scoring function; large multidimensional data sets; multidimensional instance; multidimensional space; multivalued object problem; probabilistic data; quantile-based Borda count; random variable; ranking uncertain models; Biological system modeling; Cities and towns; Data models; Economics; Educational institutions; Indexes; Probabilistic logic; Multivalued objects; consensus-based ranking;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2012.250
Filename :
6399469
Link To Document :
بازگشت