Title :
A Fuzzy-Based Approach for Text Representation in Text Categorization
Author_Institution :
Japan Adv. Inst. of Sci. & Technol.
Abstract :
Document representation is one of the most important tasks in text processing, especially in text categorization. This task has many applications that include document management, information retrieval, text routing, etc. In this paper, the author proposes a novel scheme for text representation based on fuzzy set theory. A new algorithm for choosing a term set that characterizes a document in the corpus is given under the view of fuzzy set. Experimental results applied to text categorization problem using the relevance feedback technique show that our proposed method reduced the number of dimensions and achieves higher performances compared to other baseline methods. In addition, it also produces results that compare favorably to the result achieved with the all vocabulary method
Keywords :
classification; fuzzy set theory; relevance feedback; text analysis; vocabulary; document management; document representation; fuzzy set theory; fuzzy-based text representation; information retrieval; relevance feedback; text categorization; text processing; text routing; vocabulary; Feedback; Fuzzy set theory; Fuzzy sets; Indexing; Information management; Information retrieval; Routing; Text categorization; Text processing; Vocabulary;
Conference_Titel :
Fuzzy Systems, 2005. FUZZ '05. The 14th IEEE International Conference on
Conference_Location :
Reno, NV
Print_ISBN :
0-7803-9159-4
DOI :
10.1109/FUZZY.2005.1452532