Title of article :
On the complexity of Rocchioʹs similarity-based relevance feedback algorithm
Author/Authors :
Zhixiang Chen، نويسنده , , Bin Fu، نويسنده ,
Issue Information :
ماهنامه با شماره پیاپی سال 2007
Pages :
9
From page :
1392
To page :
1400
Abstract :
Rocchioʹs similarity-based relevance feedback algorithm, one of the most important query reformation methods in information retrieval, is essentially an adaptive learning algorithm from examples in searching for documents represented by a linear classifier. Despite its popularity in various applications, there is little rigorous analysis of its learning complexity in literature. In this article, the authors prove for the first time that the learning complexity of Rocchioʹs algorithm is O(d + d2(log d + log n)) over the discretized vector space {0,…, n − 1}d, when the inner product similarity measure is used. The upper bound on the learning complexity for searching for documents represented by a monotone linear classifier equation image over {0,…, n − 1}d can be improved to, at most, 1 + 2k (n − 1) (log d − log(n − 1)), where k is the number of nonzero components in q. Several lower bounds on the learning complexity are also obtained for Rocchioʹs algorithm. For example, the authors prove that Rocchioʹs algorithm has a lower bound equation image on its learning complexity over the Boolean vector space {0, 1}d.
Journal title :
Journal of the American Society for Information Science and Technology
Serial Year :
2007
Journal title :
Journal of the American Society for Information Science and Technology
Record number :
993557
Link To Document :
بازگشت