DocumentCode :
3123220
Title :
Corrupted and missing predictors: Minimax bounds for high-dimensional linear regression
Author :
Loh, Po-Ling ; Wainwright, Martin J.
Author_Institution :
Dept. of Stat., Univ. of California, Berkeley, Berkeley, CA, USA
fYear :
2012
fDate :
1-6 July 2012
Firstpage :
2601
Lastpage :
2605
Abstract :
Missing and corrupted data are ubiquitous in many science and engineering domains. We analyze the information-theoretic limits of recovering sparse vectors under various models of corrupted and missing data. In particular, consider a high-dimensional linear regression model y = X β* + ϵ, where y ∈ Rn is the response vector, X ∈ RnXp is a random design matrix with p ≫ n and rows distributed i.i.d. as N(0, Σx), β* ∈ Rp is the unknown regression vector, and ϵ ~ N(0,σϵ2I) is independent additive noise. Whereas a traditional approach assumes that the covariates X are fully observed, we assume only that a corrupted version Z is observed. Our main contribution is to establish minimax rates of convergence for estimating β* in squared ℓ2-loss, assuming β* is k-sparse. Our upper and lower bounds in both additive noise and missing data cases scale as k log(p/k)/n, with prefactors depending only on the corruption and/or missing pattern of the data.
Keywords :
information theory; matrix algebra; regression analysis; high-dimensional linear regression model; independent additive noise; information-theoretic analysis; lower bounds; minimax bounds; random design matrix; sparse vector recovery; squared ℓ2-loss; upper bounds; Additive noise; Equations; Estimation; Linear regression; Standards; Upper bound; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Theory Proceedings (ISIT), 2012 IEEE International Symposium on
Conference_Location :
Cambridge, MA
ISSN :
2157-8095
Print_ISBN :
978-1-4673-2580-6
Electronic_ISBN :
2157-8095
Type :
conf
DOI :
10.1109/ISIT.2012.6283989
Filename :
6283989
Link To Document :
بازگشت