• DocumentCode
    2699436
  • Title

    Insights from Viewing Ranked Retrieval as Rank Aggregation

  • Author

    Bast, Holger ; Weber, Ingmar

  • Author_Institution
    Max-Planck-Inst. fur Informatik, Saarbrucken
  • fYear
    2005
  • fDate
    8-9 April 2005
  • Firstpage
    232
  • Lastpage
    239
  • Abstract
    We view a variety of established methods for ranked retrieval from a common angle, namely as a process of combining query-independent rankings that were precomputed for certain attributes. Apart from a general insight into what effectively distinguishes various schemes from each other, we obtain three specific results concerned with concept-based retrieval. First, we prove that latent semantic indexing (LSI) can be implemented to answer queries in time proportional to the number of words in the query, which improves over the standard implementation by an order of magnitude; a similar result is established for LSI´s probabilistic sibling PLSI. Second, we give a simple and precise characterization of the extent, to which latent semantic indexing (LSI) can deal with polysems, and when it fails to do so. Third, we demonstrate that the recombination of the intricate, yet relatively cheap mechanism of PLSI for mapping queries to attributes, with a simplistic, easy-to-compute set of document rankings gives a retrieval performance which is at least as good as that of the most sophisticated concept-based retrieval schemes and which does not require any precomputation
  • Keywords
    content-based retrieval; information retrieval; concept-based retrieval; document rankings; latent semantic indexing; probabilistic sibling PLSI; query mapping; query-independent rankings; rank aggregation; ranked retrieval; Conferences; Frequency; Humans; Indexing; Information retrieval; Large scale integration; Ontologies; Vectors; Web search; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Information Retrieval and Integration, 2005. WIRI '05. Proceedings. International Workshop on Challenges in
  • Conference_Location
    Tokyo
  • Print_ISBN
    0-7695-2414-1
  • Type

    conf

  • DOI
    10.1109/WIRI.2005.19
  • Filename
    1553019