DocumentCode
249387
Title
Towards Big Data Bayesian Network Learning - An Ensemble Learning Based Approach
Author
Yan Tang ; Yu Wang ; Cooper, Kendra M. L. ; Ling Li
Author_Institution
Coll. of Comput. & Inf, Hohai Univ., Nanjing, China
fYear
2014
fDate
June 27 2014-July 2 2014
Firstpage
355
Lastpage
357
Abstract
Recently, we are entering the Big Data era[[1]]. The Bayesian Network (BN), as a directed probabilistic graph model, is providing intuitive knowledge presentation and accurate prediction for many mission critical areas. However, the current algorithms do not scale well for Big Data Bayesian network learning. This paper proposes a novel parallel BN learning algorithm called PENBays (Parallel ENsemble based Bayesian Networks Learning), which integrates the best BN learning algorithms MMHC, TPDA and REC. It has three phases: Data Preprocess (DP), Individual Ensemble Learning (IEL) and Central Ensemble Learning (CNL). Through these phases, PENBays effectively learns a BN rapidly from large datasets. Experiments reveal that PENBays learns BNs with better accuracy than base line learning algorithms like MMHC, TPDA and REC, showing promising application potential in the big data mining area.
Keywords
Bayes methods; Big Data; data mining; directed graphs; learning (artificial intelligence); parallel algorithms; Big Data Bayesian network learning; Big Data mining; CNL; DP; IEL; MMHC; PENBays; REC; TPDA; central ensemble learning; data preprocess; directed probabilistic graph model; ensemble learning based approach; individual ensemble learning; knowledge presentation; parallel BN learning algorithm; parallel ensemble based Bayesian network learning; Algorithm design and analysis; Bayes methods; Big data; Computational modeling; Data models; IEL; Prediction algorithms; Bayesian network; Big Data; Distributed computing; Ensemble learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data (BigData Congress), 2014 IEEE International Congress on
Conference_Location
Anchorage, AK
Print_ISBN
978-1-4799-5056-0
Type
conf
DOI
10.1109/BigData.Congress.2014.58
Filename
6906801
Link To Document