• DocumentCode
    677996
  • Title

    MapReduce Based Method for Big Data Semantic Clustering

  • Author

    Jie Yang ; Xiaoping Li

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Southeast Univ., Nanjing, China
  • fYear
    2013
  • fDate
    13-16 Oct. 2013
  • Firstpage
    2814
  • Lastpage
    2819
  • Abstract
    Big data analysis is very hot in cloud computing environments. How to automatically map heterogeneous data with the same semantics is one of the key problems in big data analysis. A big data clustering method based on the MapReduce framework is proposed in this paper. Big data are decomposed into many data chunks for parallel clustering, which is implemented by Ant Colony. Data elements are moved and clustered by ants according to the presented criterion. The proposed method is compared with the MapReduce framework based k-means clustering algorithm on a great amount of practical data. Experimental results show that the proposal is much effective for big data clustering.
  • Keywords
    cloud computing; data handling; optimisation; pattern clustering; MapReduce based method; ant colony; big data analysis; big data semantic clustering; cloud computing environments; data chunks; k-means clustering algorithm; parallel clustering; Accuracy; Algorithm design and analysis; Clustering algorithms; Data handling; Data storage systems; Information management; Semantics; Ant colony; MapReduce; big data; cloud computing; k-means;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics (SMC), 2013 IEEE International Conference on
  • Conference_Location
    Manchester
  • Type

    conf

  • DOI
    10.1109/SMC.2013.480
  • Filename
    6722233