DocumentCode
3402381
Title
A Fuzzy-Based Approach for Text Representation in Text Categorization
Author
Doan, Son
Author_Institution
Japan Adv. Inst. of Sci. & Technol.
fYear
2005
fDate
25-25 May 2005
Firstpage
1008
Lastpage
1013
Abstract
Document representation is one of the most important tasks in text processing, especially in text categorization. This task has many applications that include document management, information retrieval, text routing, etc. In this paper, the author proposes a novel scheme for text representation based on fuzzy set theory. A new algorithm for choosing a term set that characterizes a document in the corpus is given under the view of fuzzy set. Experimental results applied to text categorization problem using the relevance feedback technique show that our proposed method reduced the number of dimensions and achieves higher performances compared to other baseline methods. In addition, it also produces results that compare favorably to the result achieved with the all vocabulary method
Keywords
classification; fuzzy set theory; relevance feedback; text analysis; vocabulary; document management; document representation; fuzzy set theory; fuzzy-based text representation; information retrieval; relevance feedback; text categorization; text processing; text routing; vocabulary; Feedback; Fuzzy set theory; Fuzzy sets; Indexing; Information management; Information retrieval; Routing; Text categorization; Text processing; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems, 2005. FUZZ '05. The 14th IEEE International Conference on
Conference_Location
Reno, NV
Print_ISBN
0-7803-9159-4
Type
conf
DOI
10.1109/FUZZY.2005.1452532
Filename
1452532
Link To Document