Title :
Improving text categorization by resolving semantic ambiguity
Author :
Uejima, Hiroshi ; Miura, Tsuyoshi ; Shioya, Lsamu
Author_Institution :
Dept. of Elect. & Elect. Engr., Hosei Univ., Tokyo, Japan
Abstract :
In this investigation, we propose a new method for text categorization (TC) based on Bayesian approach by resolving ambiguity. The TC assumes weights to words of which meanings are ambiguous in a sense of synonymy and polysemy. We give weights to articles by examining dictionaries of thesaurus and of dimensionality reduction to improve the quality of TC. Also we show some experiments to illustrate how well our approach goes.
Keywords :
dictionaries; text analysis; thesauri; word processing; Bayesian approach; dimensionality reduction; polysemy; semantic ambiguity resolving; text categorization quality improvement; text mining; thesaurus dictionary; Bayesian methods; Data mining; Dictionaries; Humans; Informatics; Information retrieval; Routing; Stress; Text categorization; Thesauri;
Conference_Titel :
Communications, Computers and signal Processing, 2003. PACRIM. 2003 IEEE Pacific Rim Conference on
Print_ISBN :
0-7803-7978-0
DOI :
10.1109/PACRIM.2003.1235901