DocumentCode
3717123
Title
Towards scalable quantile regression trees
Author
Harish S. Bhat;Nitesh Kumar;Garnet J. Vaz
Author_Institution
Applied Mathematics Unit, UC Merced, Merced, USA
fYear
2015
Firstpage
53
Lastpage
60
Abstract
We provide an algorithm to build quantile regression trees in O(N log N) time, where N is the number of instances in the training set. Quantile regression trees are regression trees that model conditional quantiles of the response variable, rather than the conditional expectation as in standard regression trees. We build quantile regression trees by using the quantile loss function in our node splitting criterion. The performance of our algorithm stems from new online update procedures for both the quantile function and the quantile loss function. We test the quantile tree algorithm in three ways, comparing its running time against implementations of standard regression trees, demonstrating its ability to recover a known set of nonlinear quantile functions, and showing that quantile trees yield smaller test set errors (computed using mean absolute deviation) than standard regression trees. The tests include training sets with up to 16 million instances. Overall, our results enable future use of quantile regression trees for large-scale data mining.
Keywords
"Regression tree analysis","Vegetation","Prediction algorithms","Yttrium","Standards","Approximation methods","Training"
Publisher
ieee
Conference_Titel
Big Data (Big Data), 2015 IEEE International Conference on
Type
conf
DOI
10.1109/BigData.2015.7363741
Filename
7363741
Link To Document