• DocumentCode
    2331665
  • Title

    Improvement of Text Feature Selection Method Based on TFIDF

  • Author

    Qu, Shouning ; Wang, Sujuan ; Zou, Yan

  • Author_Institution
    Sch. of Inf. Sci. & Eng., Univ. of Jinan, Jinan
  • fYear
    2008
  • fDate
    20-20 Nov. 2008
  • Firstpage
    79
  • Lastpage
    81
  • Abstract
    TFIDF is a kind of common methods used to select the text feature, but it has many disadvantages. First, the method undervalues that this term can represent the characteristic of the documents of this class if it only frequently appears in the documents belongs to the same class while infrequently in the documents of the other class. Second TFIDF neglects the relations between the feature and the class. The paper proposed the improved TFIDF strategy, and combined with the text classification method of simple distance vector to compare to traditional TFIDF, and obtained the very good classified effect, the experiment proved its feasibility.
  • Keywords
    pattern classification; text analysis; vectors; TFIDF; distance vector; text classification method; text feature selection method; Aggregates; Communications technology; Engineering management; Frequency; Information management; Information technology; Seminars; Technology management; Text categorization; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Future Information Technology and Management Engineering, 2008. FITME '08. International Seminar on
  • Conference_Location
    Leicestershire, United Kingdom
  • Print_ISBN
    978-0-7695-3480-0
  • Type

    conf

  • DOI
    10.1109/FITME.2008.25
  • Filename
    4746446