• DocumentCode
    3402381
  • Title

    A Fuzzy-Based Approach for Text Representation in Text Categorization

  • Author

    Doan, Son

  • Author_Institution
    Japan Adv. Inst. of Sci. & Technol.
  • fYear
    2005
  • fDate
    25-25 May 2005
  • Firstpage
    1008
  • Lastpage
    1013
  • Abstract
    Document representation is one of the most important tasks in text processing, especially in text categorization. This task has many applications that include document management, information retrieval, text routing, etc. In this paper, the author proposes a novel scheme for text representation based on fuzzy set theory. A new algorithm for choosing a term set that characterizes a document in the corpus is given under the view of fuzzy set. Experimental results applied to text categorization problem using the relevance feedback technique show that our proposed method reduced the number of dimensions and achieves higher performances compared to other baseline methods. In addition, it also produces results that compare favorably to the result achieved with the all vocabulary method
  • Keywords
    classification; fuzzy set theory; relevance feedback; text analysis; vocabulary; document management; document representation; fuzzy set theory; fuzzy-based text representation; information retrieval; relevance feedback; text categorization; text processing; text routing; vocabulary; Feedback; Fuzzy set theory; Fuzzy sets; Indexing; Information management; Information retrieval; Routing; Text categorization; Text processing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems, 2005. FUZZ '05. The 14th IEEE International Conference on
  • Conference_Location
    Reno, NV
  • Print_ISBN
    0-7803-9159-4
  • Type

    conf

  • DOI
    10.1109/FUZZY.2005.1452532
  • Filename
    1452532