DocumentCode
1662602
Title
When You Doubt, Abstain: From Misclassification to Epoché in Automatic Text Categorisation
Author
Locoro, Angela ; Grignani, Daniele ; Mascardi, Viviana
Author_Institution
Comput. Sci. Dept., Univ. of Genova, Genova, Italy
Volume
3
fYear
2011
Firstpage
209
Lastpage
212
Abstract
This paper describes how natural language processing and ontologies are exploited for automatic text categorisation. The approach introduced is part of the MANENT system, an infrastructure for integrating, structuring and searching Digital Libraries. The procedure of structural information extraction, and of the automatic classification of the records according to natural language understanding and the WordNet Domains taxonomy is discussed. A comparison between two versions of the classification algorithm is conducted and the improvements of the new approach are articulated. In particular, using semantic connections between words refines the classification results while reducing misclassification to no classification.
Keywords
classification; digital libraries; information retrieval; natural language processing; ontologies (artificial intelligence); text analysis; MANENT system; WordNet domain taxonomy; automatic text categorisation; digital libraries; natural language processing; natural language understanding; ontologies; record automatic classification; structural information extraction; Educational institutions; Frequency domain analysis; Humans; Libraries; Ontologies; Semantics; Tagging; automatic text categorisation; natural language processing; semantic digital libraries; wordnet domains;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
Conference_Location
Lyon
Print_ISBN
978-1-4577-1373-6
Electronic_ISBN
978-0-7695-4513-4
Type
conf
DOI
10.1109/WI-IAT.2011.65
Filename
6040842
Link To Document