DocumentCode :
2753079
Title :
Classification of Semantic Documents Based on WordNet
Author :
Shi, Bin ; Fang, Liying ; Yan, Jianzhuo ; Wang, Pu ; Dong, Chen
Author_Institution :
Coll. of Electron. Inf. & Control Eng., Beijing Univ. of Technol., Beijing, China
fYear :
2009
fDate :
5-6 Dec. 2009
Firstpage :
173
Lastpage :
176
Abstract :
There are a lot benefits to enable intelligent agent understanding the information from semantic Web. It enhances the efficiency of information usage and at the same time, suffices the need of users. Semantic documents contain adequate semantic information which helps understanding. However, discrepancy between ontology which is an interpreter of semantic document prevents the share of knowledge. In this paper, we proposed a uniform representation for the content, which include concepts and relations, of semantic documents based on WordNet. First, disambiguation is preceded within the key words in a document for the purpose of mapping them to concepts. Then we present the whole document in the form of concept graph that Levenshtein Distance could be applied for making a classification of documents. We have empirical result that this methodology makes a promising raise in accuracy.
Keywords :
graph theory; ontologies (artificial intelligence); semantic Web; word processing; Levenshtein distance; WordNet; concept graph; disambiguation; document classification; knowledge share; ontology; semantic Web; semantic documents; semantic information; Clustering methods; Educational institutions; Electronic government; Electronic learning; Frequency; Indexing; Information systems; Intelligent agent; Ontologies; Semantic Web; documents classification; word disambiguation; wordnet;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
E-Learning, E-Business, Enterprise Information Systems, and E-Government, 2009. EEEE '09. International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-0-7695-3907-2
Type :
conf
DOI :
10.1109/EEEE.2009.15
Filename :
5359268
Link To Document :
بازگشت