• DocumentCode
    3145490
  • Title

    Anti-serendipity: finding useless documents and similar documents

  • Author

    Cooper, James W. ; Prager, John M.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2000
  • fDate
    4-7 Jan. 2000
  • Abstract
    The problem of finding your way through a relatively unknown collection of digital documents can be daunting. Such collections sometimes have few categories and little hierarchy, or they have so much hierarchy that valuable relations between documents can easily become obscured. We describe here how our work in the area of term-recognition and sentence-based summarization can be used to filter the document lists that we return from searches. We can thus remove or downgrade the ranking of some documents that have limited utility even though they may match many of the search terms fairly accurately. We also describe how we can use this same system to find documents that are closely related to a document of interest, thus continuing our work to provide tools for query-free searching.
  • Keywords
    information retrieval; anti-serendipity; query-free searching; sentence-based summarization; similar documents; term-recognition; useless documents; Computer interfaces; Displays; Feedback; Filters; Performance analysis; Search engines; Statistics; Text analysis; Thesauri;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Sciences, 2000. Proceedings of the 33rd Annual Hawaii International Conference on
  • Print_ISBN
    0-7695-0493-0
  • Type

    conf

  • DOI
    10.1109/HICSS.2000.926691
  • Filename
    926691