Title :
Near-Optimal Coresets for Least-Squares Regression
Author :
Boutsidis, Christos ; Drineas, Petros ; Magdon-Ismail, Malik
Author_Institution :
Dept. of Math. Sci., IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
We study the (constrained) least-squares regression as well as multiple response least-squares regression and ask the question of whether a subset of the data, a coreset, suffices to compute a good approximate solution to the regression. We give deterministic, low-order polynomial-time algorithms to construct such coresets with approximation guarantees, together with lower bounds indicating that there is not much room for improvement upon our results.
Keywords :
least squares approximations; regression analysis; approximate solution; least squares regression; near optimal coresets; polynomial time algorithms; Approximation algorithms; Approximation methods; Distributed databases; Electronic mail; Linear regression; Time series analysis; Vectors; Least mean square algorithms; machine learning algorithms; regression analysis;
Journal_Title :
Information Theory, IEEE Transactions on
DOI :
10.1109/TIT.2013.2272457