DocumentCode :
3402381
Title :
A Fuzzy-Based Approach for Text Representation in Text Categorization
Author :
Doan, Son
Author_Institution :
Japan Adv. Inst. of Sci. & Technol.
fYear :
2005
fDate :
25-25 May 2005
Firstpage :
1008
Lastpage :
1013
Abstract :
Document representation is one of the most important tasks in text processing, especially in text categorization. This task has many applications that include document management, information retrieval, text routing, etc. In this paper, the author proposes a novel scheme for text representation based on fuzzy set theory. A new algorithm for choosing a term set that characterizes a document in the corpus is given under the view of fuzzy set. Experimental results applied to text categorization problem using the relevance feedback technique show that our proposed method reduced the number of dimensions and achieves higher performances compared to other baseline methods. In addition, it also produces results that compare favorably to the result achieved with the all vocabulary method
Keywords :
classification; fuzzy set theory; relevance feedback; text analysis; vocabulary; document management; document representation; fuzzy set theory; fuzzy-based text representation; information retrieval; relevance feedback; text categorization; text processing; text routing; vocabulary; Feedback; Fuzzy set theory; Fuzzy sets; Indexing; Information management; Information retrieval; Routing; Text categorization; Text processing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems, 2005. FUZZ '05. The 14th IEEE International Conference on
Conference_Location :
Reno, NV
Print_ISBN :
0-7803-9159-4
Type :
conf
DOI :
10.1109/FUZZY.2005.1452532
Filename :
1452532
Link To Document :
بازگشت