• DocumentCode
    3317204
  • Title

    Named Entity Recognition from Biomedical Text Using SVM

  • Author

    Ju, Zhenfei ; Jian Wang ; Zhu, Fei

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
  • fYear
    2011
  • fDate
    10-12 May 2011
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Nowadays biomedical research is developing rapidly. A large number of biomedical knowledge exists in the form of unstructured text documents in various files. Named Entity Recognition (NER) from biomedical text is one of the basic task s of biomedical text mining, of which purpose is to recognize the name of the specified type from the collection of biomedical text. NER result is usually the processing object of other text mining. NER from biological text is the foundation of bioinformatics research. At present, the best f-measure of biological named entity recognition system has reached more than 80%, but is lower than general NER system which can reach about 90%. Here we use support vector machine (SVM), which is an effective and efficient tool to analyze data and recognize patterns, to recognize biomedical named entity. We get data set from GENIA corpus which is a collection of Medline abstracts. In the experiment, we get precision rate= 84.24% and recall rate=80.76% finally.
  • Keywords
    bioinformatics; data analysis; data mining; document handling; pattern recognition; support vector machines; GENIA corpus; Medline abstract; NER system; SVM; bioinformatics research; biomedical knowledge; biomedical research; biomedical text mining; data analysis; named entity recognition; pattern recognition; support vector machine; unstructured text document; Biological information theory; Conferences; Data mining; Hidden Markov models; Support vector machines; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedical Engineering, (iCBBE) 2011 5th International Conference on
  • Conference_Location
    Wuhan
  • ISSN
    2151-7614
  • Print_ISBN
    978-1-4244-5088-6
  • Type

    conf

  • DOI
    10.1109/icbbe.2011.5779984
  • Filename
    5779984