• DocumentCode
    721162
  • Title

    Performance evaluation of categorizing technical support requests using advanced K-Means algorithm

  • Author

    Nadaf, Mubina A. ; Patil, Sachin S.

  • Author_Institution
    Dept. of CSE, Rajarambapu Inst. of Technol., Islampur, India
  • fYear
    2015
  • fDate
    12-13 June 2015
  • Firstpage
    409
  • Lastpage
    414
  • Abstract
    Technical support service providers receive thousands of customer queries daily. Traditionally, such organizations discard the data due to lack of storage capacity. However, value of storing such data is needed for the better results of analysis and to improve the closure rate of the daily customer queries. Data mining is the process of finding important and meaningful information, patterns through the large amount of data. Clustering is used as one of the best concept for data analysis, using machine learning approach with mathematical and statistical methods. Cluster analysis is widely applicable for practical applications in emerging trends in data mining. Analysis of clustering algorithms such as K-Means, Dirichlet, Fuzzy K-Means Canopy algorithms is done by means of the practical approach, in this research work. Performance of algorithm is observed based on the execution or computational time and results are compared with each of these algorithms. This paper proposes the streaming K-Means algorithm which resolves the queries as it arrives and analyses the data. Cosine distance measure plays an important role in clustering dataset. Sum of Square error is measured to check the quality of the cluster.
  • Keywords
    data analysis; data mining; learning (artificial intelligence); mathematical analysis; pattern clustering; query processing; statistical analysis; technical support services; Dirichlet; cluster analysis; clustering algorithms; cosine distance measure; customer queries; data analysis; data mining; data storing; fuzzy k-means canopy algorithms; machine learning approach; mathematical methods; performance evaluation; statistical methods; storage capacity; sum of square error; technical support requests; technical support service providers; Algorithm design and analysis; Clustering algorithms; Data mining; File systems; Machine learning algorithms; Partitioning algorithms; Sparks; Data mining; Machine Learning; MapReduce; Streaming K-Means;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advance Computing Conference (IACC), 2015 IEEE International
  • Conference_Location
    Banglore
  • Print_ISBN
    978-1-4799-8046-8
  • Type

    conf

  • DOI
    10.1109/IADCC.2015.7154740
  • Filename
    7154740