• DocumentCode
    1869240
  • Title

    Matchmaking through semantic annotation and similarity measurement

  • Author

    Ensan, A. ; Biletskiy, Yevgen

  • Author_Institution
    Univ. of New Brunswick, Fredericton, NB, Canada
  • fYear
    2012
  • fDate
    April 29 2012-May 2 2012
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    The proposed work briefly describes an approach to automatically extract structured information from semi-structured documents to match the document creators and users in order to find the best similarities between them and connect them for further collaborations. The general idea is to employ a semantic annotation technique and similarity measurement approach by using the ontology to find best matches between web documents. The proposed approach uses ontologies to annotate the extracted information and for the measuring the similarity between each pair of documents. GATE (General Architecture for Text Engineering) as one of the most famous annotation tools has been utilized to annotate semi-structure documents. A novel algorithm is proposed to update the supported ontology for extraction purpose in GATE by using a training data set. Furthermore, specific domain-based metrics are also utilized to measure semantic similarities between documents with regard to semantic annotations which are implemented in an ontology-based approach. These metrics can be used in order to find the most similar web documents among documents corpus.
  • Keywords
    groupware; ontologies (artificial intelligence); text analysis; GATE; Web documents; collaborations; document creator-user matching; domain-based metrics; general architecture for text engineering; matchmaking; ontology-based approach; semantic annotation technique; semistructure document annotation; semistructured documents; similarity measurement approach; structured information extraction; training data set; Data mining; Information retrieval; Logic gates; Ontologies; Semantics; Vectors; XML; Computer Applications; Information Extraction; Matchmaking; Ontology; Semantic Annotation; Semantic Similarity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical & Computer Engineering (CCECE), 2012 25th IEEE Canadian Conference on
  • Conference_Location
    Montreal, QC
  • ISSN
    0840-7789
  • Print_ISBN
    978-1-4673-1431-2
  • Electronic_ISBN
    0840-7789
  • Type

    conf

  • DOI
    10.1109/CCECE.2012.6334966
  • Filename
    6334966