• DocumentCode
    260197
  • Title

    An optimized approach for unbalanced big data categorizing using fuzzy clustering

  • Author

    Mehneh, Saman Fallah ; Toosi, JalilGazalan ; Jalali, Mehrdad

  • Author_Institution
    Dept. of Software Eng., Islamic Azad Univ. Mashhad, Mashhad, Iran
  • fYear
    2014
  • fDate
    26-27 Nov. 2014
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Big data is a set of very large and complex data that is hard to load on computers. The main challenge in big data world is related to their search, categorize and analyze specially, when they are unbalanced. Despite, there are a lot of works in the field of big data but analyzing unbalanced big data is still a fundamental challenge in this area. In this paper we try to solve the problem of RSIO-LFCM method in face with unbalanced data and in training phase, we increase its accuracy in order to identify classes with low frequency of samples. Our proposed method starts with adding a little change in the initial phase of the algorithm. Then we add a phase in order to balance samples frequency to resolve RSIO-LFCM problems. The results show that in compare with RSIO-LFCM method, our proposed method has better accuracy in identifying super clusters and its corresponding super classes and also in identifying small clusters and classes.
  • Keywords
    Big Data; fuzzy set theory; pattern clustering; Big Data categorizatoin; RSIO-LFCM method; fuzzy clustering; optimized approach; Accuracy; Big data; Clustering algorithms; Computers; Educational institutions; Software engineering; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Technology, Communication and Knowledge (ICTCK), 2014 International Congress on
  • Conference_Location
    Mashhad
  • Type

    conf

  • DOI
    10.1109/ICTCK.2014.7033504
  • Filename
    7033504