• DocumentCode
    2158215
  • Title

    A Sentence Classification System for Multi Biomedical Literature Summarization

  • Author

    Yamamoto, Yasunori ; Takagi, Toshihisa

  • Author_Institution
    University of Tokyo, Tokyo, Japan
  • fYear
    2005
  • fDate
    05-08 April 2005
  • Firstpage
    1163
  • Lastpage
    1163
  • Abstract
    A PubMed search often returns a long list of queryrelated papers that a researcher cannot cope with in a short time. As a first step to address this issue by summarizing retrieved papers, we developed a system to classify sentences of abstracts obtained from the MEDLINE database into five rhetorical statuses: background, purpose, method, result, or conclusion. We used Support Vector Machine (SVM) classifiers and trained each of them for a different rhetorical status on structured abstracts. A structured abstract is one that has labels indicating rhetorical statuses of the sentences, while an unstructured abstract does not. The classifiers were tested on both structured and unstructured abstracts. The former were randomly obtained from the MEDLINE database and the latter were manually labeled by humans. We compared our method with a previously reported one. In addition, we combined them and evaluated the combined method. Our method outperformed the previously reported one, and the combined method showed even better results. Classified abstracts can be used for multi-document summarization that provides researchers with a way of learning a research topic efficiently and effectively.
  • Keywords
    Abstracts; Computer science; Costs; Databases; History; Humans; Information retrieval; Protein engineering; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering Workshops, 2005. 21st International Conference on
  • Print_ISBN
    0-7695-2657-8
  • Type

    conf

  • DOI
    10.1109/ICDE.2005.170
  • Filename
    1647766