DocumentCode
1869240
Title
Matchmaking through semantic annotation and similarity measurement
Author
Ensan, A. ; Biletskiy, Yevgen
Author_Institution
Univ. of New Brunswick, Fredericton, NB, Canada
fYear
2012
fDate
April 29 2012-May 2 2012
Firstpage
1
Lastpage
5
Abstract
The proposed work briefly describes an approach to automatically extract structured information from semi-structured documents to match the document creators and users in order to find the best similarities between them and connect them for further collaborations. The general idea is to employ a semantic annotation technique and similarity measurement approach by using the ontology to find best matches between web documents. The proposed approach uses ontologies to annotate the extracted information and for the measuring the similarity between each pair of documents. GATE (General Architecture for Text Engineering) as one of the most famous annotation tools has been utilized to annotate semi-structure documents. A novel algorithm is proposed to update the supported ontology for extraction purpose in GATE by using a training data set. Furthermore, specific domain-based metrics are also utilized to measure semantic similarities between documents with regard to semantic annotations which are implemented in an ontology-based approach. These metrics can be used in order to find the most similar web documents among documents corpus.
Keywords
groupware; ontologies (artificial intelligence); text analysis; GATE; Web documents; collaborations; document creator-user matching; domain-based metrics; general architecture for text engineering; matchmaking; ontology-based approach; semantic annotation technique; semistructure document annotation; semistructured documents; similarity measurement approach; structured information extraction; training data set; Data mining; Information retrieval; Logic gates; Ontologies; Semantics; Vectors; XML; Computer Applications; Information Extraction; Matchmaking; Ontology; Semantic Annotation; Semantic Similarity;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical & Computer Engineering (CCECE), 2012 25th IEEE Canadian Conference on
Conference_Location
Montreal, QC
ISSN
0840-7789
Print_ISBN
978-1-4673-1431-2
Electronic_ISBN
0840-7789
Type
conf
DOI
10.1109/CCECE.2012.6334966
Filename
6334966
Link To Document