• DocumentCode
    2087627
  • Title

    Finding themes in Medline documents - probabilistic similarity search

  • Author

    Shatkey, H. ; Wilber, W.J.

  • Author_Institution
    Nat. Cancer for Biotechnol. Inf., Nat. Inst. of Health, Bethesda, MD, USA
  • fYear
    2000
  • fDate
    24-24 May 2000
  • Firstpage
    183
  • Lastpage
    192
  • Abstract
    Large on-line document databases, such as Medine, pose a major challenge of retrieving the few documents most relevant to the user´s needs, while multimizing the return rate of nonrelevant documents. Retrieval of documents similar to a user provided example document is a promising query paradigm towards meeting this goal. We present a new theme-based probabilistic approach for finding documents relevant to a given query document, and summarizing their contents. Preliminary experiments conducted over a subset of Medline documents related to AIDS demonstrate the effectiveness of our approach.
  • Keywords
    information resources; information retrieval; medical information systems; AIDS; Medline documents; document retrieval; large on-line document databases; probabilistic similarity search; query document; query paradigm; theme-based probabilistic approach; Abstracts; Acquired immune deficiency syndrome; Biotechnology; Databases; Electrical capacitance tomography; Human immunodeficiency virus; Libraries; Natural language processing; Thesauri; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Digital Libraries, 2000. Proceedings. IEEE
  • Conference_Location
    Washington, DC, USA
  • Print_ISBN
    0-7695-0659-3
  • Type

    conf

  • DOI
    10.1109/ADL.2000.848381
  • Filename
    848381