Title :
Semantic Labelling for Document Feature Patterns Using Ontological Subjects
Author :
Xiaohui Tao ; Yyuefeng Li ; Bin Liu ; Yan Shen
Author_Institution :
Centre for Syst. Biol., Univ. of Southern Queensland, Toowoomba, QLD, Australia
Abstract :
Finding and labelling semantic features patterns of documents in a large, spatial corpus is a challenging problem. Text documents have characteristics that make semantic labelling difficult, the rapidly increasing volume of online documents makes a bottleneck in finding meaningful textual patterns. Aiming to deal with these issues, we propose an unsupervised documnent labelling approach based on semantic content and feature patterns. A world ontology with extensive topic coverage is exploited to supply controlled, structured subjects for labelling. An algorithm is also introduced to reduce dimensionality based on the study of ontological structure. The proposed approach was promisingly evaluated by compared with typical machine learning methods including SVMs, Rocchio, and kNN.
Keywords :
ontologies (artificial intelligence); text analysis; unsupervised learning; word processing; data dimensionality reduction; document semantic feature pattern finding; online text document volume; semantic content; spatial corpus; textual patterns; topic coverage; unsupervised document semantic feature pattern labelling; world ontology; Ontology; Semantic labeling; feature; pattern;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-6057-9
DOI :
10.1109/WI-IAT.2012.47