Title :
Çok boyutlu veriden rastgele ormanlar ile düşük karmaşıklıklı güdümsüz öğrenme
Author :
Aydin, Ali Selman ; Arici, Tank ; Bulut, Ahmet
Abstract :
With the ever increasing rate of digital information available from online sources, information has gone from being scarce to being abundant. Big data analytics require low complexity and distributed computing techniques. We propose the use of randomized decision trees and their ensemble in the form of a forest for unsupervised learning. Random probing of good attributes reduces the computational complexity making learning feasible on high-dimensional big data. Using an ensemble of trees improves the learning. We propose a new splitting measure for tree construction and an aggregation mechanism for predictive learning (unsupervised classification). The experiments on standard datasets show that our proposed proposed method outperforms the state-of-the-art.
Keywords :
Big Data; data analysis; decision trees; pattern classification; unsupervised learning; Big Data analytics; aggregation mechanism; computational complexity; digital information; distributed computing techniques; ensemble learning; high-dimensional Big Data; multidimensional data; online sources; predictive learning; random forests; randomized decision trees; splitting measure; tree construction; unsupervised classification; unsupervised learning; Complexity theory; Conferences; Decision trees; Histograms; Signal processing; Vegetation; Random forests; clustering; decision trees; unsupervised learnings;
Conference_Titel :
Signal Processing and Communications Applications Conference (SIU), 2014 22nd
Conference_Location :
Trabzon
DOI :
10.1109/SIU.2014.6830708