• DocumentCode
    2349865
  • Title

    Result identification for biomedical abstracts using Conditional Random Fields

  • Author

    Lin, Ryan T.K. ; Dai, Hong-Jei ; Bow, Yue-Yang ; Day, Min-Yuh ; Tzong-Han Tsai, Richard ; Hsu, Wen-Lian

  • Author_Institution
    Institute of Information Science, Academia Sinica, Taipei, China
  • fYear
    2008
  • fDate
    13-15 July 2008
  • Firstpage
    122
  • Lastpage
    126
  • Abstract
    For biomedical research, the most important parts of an abstract are the result and conclusion sections. Some journals divide an abstract into several sections so that readers can easily identify those parts, but others do not. We propose a method that can automatically identify the result and conclusion sections of any biomedical abstracts by formulating this identification problem as a sequence labeling task. Three feature sets (Position, Named Entity, and Word Frequency) are employed with Conditional Random Fields (CRFs) as the underlying machine learning model. Experimental results show that the combination of our proposed feature sets can achieve F-measure, precision, and recall scores of 92.50%, 95.32% and 89.85%, respectively.
  • Keywords
    Abstracts; Computer science; Frequency; Hidden Markov models; Hypertension; Information science; Machine learning; Support vector machine classification; Support vector machines; Text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration, 2008. IRI 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV, USA
  • Print_ISBN
    978-1-4244-2659-1
  • Electronic_ISBN
    978-1-4244-2660-7
  • Type

    conf

  • DOI
    10.1109/IRI.2008.4583016
  • Filename
    4583016