• DocumentCode
    3542976
  • Title

    Evaluation of named entity recognition tools on microposts

  • Author

    Dlugolinsky, Stefan ; Ciglan, Marek ; Laclavik, Michal

  • Author_Institution
    Inst. of Inf., Bratislava, Slovakia
  • fYear
    2013
  • fDate
    19-21 June 2013
  • Firstpage
    197
  • Lastpage
    202
  • Abstract
    In this paper we evaluate eight well-known Information Extraction (IE) tools on a task of Named Entity Recognition (NER) in microposts. We have chosen six NLP tools and two Wikipedia concept extractors for the evaluation. Our intent was to see how these tools would perform on relatively short texts of microposts. Evaluation dataset has been adopted from the MSM 2013 IE Challenge. This dataset contained manually annotated microposts with classification restricted to four entity types: PER, LOC, ORG and MISC.
  • Keywords
    Web sites; natural language processing; text analysis; IE tools; LOC; MISC; MSM 2013 IE Challenge; NER; NLP tools; ORG; PER; Wikipedia concept extractors; evaluation dataset; information extraction tools; manually annotated microposts; named entity recognition tools evaluation; Electronic publishing; Encyclopedias; Feature extraction; Internet; Logic gates; Organizations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Engineering Systems (INES), 2013 IEEE 17th International Conference on
  • Conference_Location
    San Jose
  • Print_ISBN
    978-1-4799-0828-8
  • Type

    conf

  • DOI
    10.1109/INES.2013.6632810
  • Filename
    6632810