DocumentCode :
2133179
Title :
Improving text categorization by resolving semantic ambiguity
Author :
Uejima, Hiroshi ; Miura, Tsuyoshi ; Shioya, Lsamu
Author_Institution :
Dept. of Elect. & Elect. Engr., Hosei Univ., Tokyo, Japan
Volume :
2
fYear :
2003
fDate :
28-30 Aug. 2003
Firstpage :
796
Abstract :
In this investigation, we propose a new method for text categorization (TC) based on Bayesian approach by resolving ambiguity. The TC assumes weights to words of which meanings are ambiguous in a sense of synonymy and polysemy. We give weights to articles by examining dictionaries of thesaurus and of dimensionality reduction to improve the quality of TC. Also we show some experiments to illustrate how well our approach goes.
Keywords :
dictionaries; text analysis; thesauri; word processing; Bayesian approach; dimensionality reduction; polysemy; semantic ambiguity resolving; text categorization quality improvement; text mining; thesaurus dictionary; Bayesian methods; Data mining; Dictionaries; Humans; Informatics; Information retrieval; Routing; Stress; Text categorization; Thesauri;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Computers and signal Processing, 2003. PACRIM. 2003 IEEE Pacific Rim Conference on
Print_ISBN :
0-7803-7978-0
Type :
conf
DOI :
10.1109/PACRIM.2003.1235901
Filename :
1235901
Link To Document :
بازگشت