• DocumentCode
    1944121
  • Title

    Profile Extraction from Mean Profile for Automatic Text Categorization

  • Author

    Lakshmi, K. ; Mukherjee, Saswati

  • Author_Institution
    Comput. Sci. & Eng. Dept., Anna Univ., Madras
  • Volume
    2
  • fYear
    2005
  • fDate
    28-30 Nov. 2005
  • Firstpage
    384
  • Lastpage
    389
  • Abstract
    With overwhelming growth of information technology, better organization of documents is required for easy access of information. Hence the need for text categorization becomes critical. Many researchers have turned their attention towards text categorization. Text categorization is the automated assignment of predefined categories to the text documents based on document contents. We proposed an automatic text categorization approach that uses profiles for categorization of text documents. A new similarity method has been used for measuring similarity between profiles and documents. We have explored different ways of profile creation and concluded that the increase in distance between profiles improves the classifier performance. Our classifier has an improved performance when compared with similar kind of text categorization methods
  • Keywords
    classification; learning (artificial intelligence); text analysis; automatic text categorization; information technology; pattern classification; profile extraction; similarity method; text document; Computer science; Data mining; Document handling; Educational institutions; Explosives; Information retrieval; Information technology; Internet; Natural languages; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence for Modelling, Control and Automation, 2005 and International Conference on Intelligent Agents, Web Technologies and Internet Commerce, International Conference on
  • Conference_Location
    Vienna
  • Print_ISBN
    0-7695-2504-0
  • Type

    conf

  • DOI
    10.1109/CIMCA.2005.1631499
  • Filename
    1631499