Title :
Matchmaking through semantic annotation and similarity measurement
Author :
Ensan, A. ; Biletskiy, Yevgen
Author_Institution :
Univ. of New Brunswick, Fredericton, NB, Canada
fDate :
April 29 2012-May 2 2012
Abstract :
The proposed work briefly describes an approach to automatically extract structured information from semi-structured documents to match the document creators and users in order to find the best similarities between them and connect them for further collaborations. The general idea is to employ a semantic annotation technique and similarity measurement approach by using the ontology to find best matches between web documents. The proposed approach uses ontologies to annotate the extracted information and for the measuring the similarity between each pair of documents. GATE (General Architecture for Text Engineering) as one of the most famous annotation tools has been utilized to annotate semi-structure documents. A novel algorithm is proposed to update the supported ontology for extraction purpose in GATE by using a training data set. Furthermore, specific domain-based metrics are also utilized to measure semantic similarities between documents with regard to semantic annotations which are implemented in an ontology-based approach. These metrics can be used in order to find the most similar web documents among documents corpus.
Keywords :
groupware; ontologies (artificial intelligence); text analysis; GATE; Web documents; collaborations; document creator-user matching; domain-based metrics; general architecture for text engineering; matchmaking; ontology-based approach; semantic annotation technique; semistructure document annotation; semistructured documents; similarity measurement approach; structured information extraction; training data set; Data mining; Information retrieval; Logic gates; Ontologies; Semantics; Vectors; XML; Computer Applications; Information Extraction; Matchmaking; Ontology; Semantic Annotation; Semantic Similarity;
Conference_Titel :
Electrical & Computer Engineering (CCECE), 2012 25th IEEE Canadian Conference on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4673-1431-2
Electronic_ISBN :
0840-7789
DOI :
10.1109/CCECE.2012.6334966