• DocumentCode
    3399163
  • Title

    Building clusters with distributed features for text classification using KNN

  • Author

    Wajeed, Mohammed Abdul ; Adilakshmi, T.

  • Author_Institution
    SCSI, Sreenidhi Inst. of Sci. & Technol., Hyderabad, India
  • fYear
    2012
  • fDate
    10-12 Jan. 2012
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Bulk data is generated in the era of Information Technology. If it is not stored in a properly systematic manner then the generated data cannot be reused. This is because navigation becomes if not impossible, certainly very difficult. So we classify the data before it is stored. Present paper explores the techniques to store the data in a supervised classification paradigm using distributed features. Initially Soft, hard and mixed Clusters are build based on the distributed features later the clusters are used to classify the documents based on the K-nearest neighbour classification algorithm.
  • Keywords
    learning (artificial intelligence); pattern classification; pattern clustering; text analysis; K-nearest neighbour classification algorithm; KNN; bulk data; distributed features; information technology; supervised classification paradigm; text classification; Accuracy; Equations; Informatics; Text categorization; Training; Training data; Vectors; distributed features; knn-classifier; soft-hard clusters; text classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Communication and Informatics (ICCCI), 2012 International Conference on
  • Conference_Location
    Coimbatore
  • Print_ISBN
    978-1-4577-1580-8
  • Type

    conf

  • DOI
    10.1109/ICCCI.2012.6158839
  • Filename
    6158839