• DocumentCode
    448868
  • Title

    Benchmarking of semantic annotation with conditional random fields

  • Author

    Grilheres, Bruno ; Beauce, Christophe ; Canu, Stephane ; Brunessaux, Stephan

  • Author_Institution
    EADS DCS, CHRIS, Val de Reuil, France
  • fYear
    2005
  • fDate
    Nov. 30 2005-Dec. 1 2005
  • Firstpage
    233
  • Lastpage
    236
  • Abstract
    The Semantic Web requires document annotation with various meta-data. But for end-users, doing it manually would be extremely time consuming and unfeasible for billion of documents. To reduce this burden, Information Extraction techniques should be applied. This paper describes the use of a recent probabilistic sequence model, Conditional Random Fields, to annotate semi-automatically sets of documents. It introduces the model principles and how to configure it to maximise the detection capabilities. The approach is evaluated on a task of event detection in news press articles related to terrorism events (the MUC-LAT corpus).
  • Keywords
    document handling; information analysis; information retrieval; meta data; random processes; semantic Web; conditional random fields; detection capability; document annotation; event detection; information extraction; meta-data; news press article; probabilistic sequence model; semantic Web; semantic annotation; terrorism event;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Integration of Knowledge, Semantics and Digital Media Technology, 2005. EWIMT 2005. The 2nd European Workshop on the (Ref. No. 2005/11099)
  • Conference_Location
    London
  • ISSN
    0537-9989
  • Print_ISBN
    0-86341-595-4
  • Type

    conf

  • Filename
    1575989