• DocumentCode
    3579042
  • Title

    Scalable parallel clustering approach for large data using genetic possibilistic fuzzy c-means algorithm

  • Author

    Mathew, Juby ; Vijayakumar, R.

  • Author_Institution
    Dept. of MCA, AmalJyothi College of Engg., Kanjirapally, Kerala, India
  • fYear
    2014
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    In various domains, big data play crucial and related processes because of the latest developments in the digital planet. Such irrepressible data growth has led to bring clustering algorithms to segment the data into small sets to perform associated processes with them. However, the challenge continues in dealing with large data, because most of the algorithms are compatible only with small data. However, the existing clustering algorithms either handle different data types with inefficiency in handling large data or handle large data with limitations in considering numeric attributes. Hence, parallel clustering has come into the picture to provide crucial contribution towards clustering large data. This insists the need of having scalable parallel clustering to solve the aforesaid problems. In this paper, we have developed a scalable parallel clustering algorithm called Possibilistic Fuzzy C-Means (PFCM) clustering to cluster large data. So, our ultimate aim is to design and develop an algorithm in parallel way by considering data. The parallel architecture includes, splitting the input data and clustering each set of data using PFCM. Then the genetic firefly algorithm applied to the merged cluster data, which will provide better clustering accuracy in merge data. The experimental analysis will be carried out to evaluate the feasibility of the scalable Possibilistic Fuzzy C-Means (PFCM) clustering approach. The experimental analysis showed that the proposed approach obtained upper head over existing method in terms of accuracy and time.
  • Keywords
    Accuracy; Algorithm design and analysis; Clustering algorithms; Clustering methods; Data mining; Genetics; Indexes; FCM; Firefly algorithm; PFCM; genetic algorithm; large data; parallel clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Computing Research (ICCIC), 2014 IEEE International Conference on
  • Print_ISBN
    978-1-4799-3974-9
  • Type

    conf

  • DOI
    10.1109/ICCIC.2014.7238335
  • Filename
    7238335