• DocumentCode
    153102
  • Title

    Çok boyutlu veriden rastgele ormanlar ile düşük karmaşıklıklı güdümsüz öğrenme

  • Author

    Aydin, Ali Selman ; Arici, Tank ; Bulut, Ahmet

  • fYear
    2014
  • fDate
    23-25 April 2014
  • Firstpage
    2229
  • Lastpage
    2232
  • Abstract
    With the ever increasing rate of digital information available from online sources, information has gone from being scarce to being abundant. Big data analytics require low complexity and distributed computing techniques. We propose the use of randomized decision trees and their ensemble in the form of a forest for unsupervised learning. Random probing of good attributes reduces the computational complexity making learning feasible on high-dimensional big data. Using an ensemble of trees improves the learning. We propose a new splitting measure for tree construction and an aggregation mechanism for predictive learning (unsupervised classification). The experiments on standard datasets show that our proposed proposed method outperforms the state-of-the-art.
  • Keywords
    Big Data; data analysis; decision trees; pattern classification; unsupervised learning; Big Data analytics; aggregation mechanism; computational complexity; digital information; distributed computing techniques; ensemble learning; high-dimensional Big Data; multidimensional data; online sources; predictive learning; random forests; randomized decision trees; splitting measure; tree construction; unsupervised classification; unsupervised learning; Complexity theory; Conferences; Decision trees; Histograms; Signal processing; Vegetation; Random forests; clustering; decision trees; unsupervised learnings;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications Applications Conference (SIU), 2014 22nd
  • Conference_Location
    Trabzon
  • Type

    conf

  • DOI
    10.1109/SIU.2014.6830708
  • Filename
    6830708