• DocumentCode
    1253562
  • Title

    Shrinkage-to-Tapering Estimation of Large Covariance Matrices

  • Author

    Chen, Xiaohui ; Wang, Z. Jane ; McKeown, Martin J.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of British Columbia, Vancouver, BC, Canada
  • Volume
    60
  • Issue
    11
  • fYear
    2012
  • Firstpage
    5640
  • Lastpage
    5656
  • Abstract
    In this paper, we introduce a shrinkage-to-tapering approach for estimating large covariance matrices when the number of samples is substantially fewer than the number of variables (i.e., n,p→∞ and p/n→∞). The proposed estimator improves upon both shrinkage and tapering estimators by shrinking the sample covariance matrix to its tapered version. We first show that, under both normalized Frobenius and spectral risks, the minimum mean-squared error (MMSE) shrinkage-to-identity estimator is inconsistent and outperformed by a minimax tapering estimator for a class of high-dimensional and diagonally dominant covariance matrices. Motivated by this observation, we propose a shrinkage-to-tapering oracle (STO) estimator for efficient estimation of general, large covariance matrices. A closed-form formula of the optimal coefficient ρ of the proposed STO estimator is derived under the minimum Frobenius risk. Since the true covariance matrix is to be estimated, we further propose a STO approximating (STOA) algorithm with a data-driven bandwidth selection procedure to iteratively estimate the coefficient ρ and the covariance matrix. We study the finite sample performances of different estimators and our simulation results clearly show the improved performances of the proposed STO estimators. Finally, the proposed STOA method is applied to a real breast cancer gene expression data set.
  • Keywords
    approximation theory; covariance matrices; estimation theory; iterative methods; least mean squares methods; MMSE; STO approximating algorithm; STO estimator; STOA algorithm; breast cancer gene expression data set; closed-form formula; data-driven bandwidth selection; diagonally dominant covariance matrices; high-dimensional covariance matrices; large covariance matrices; minimax tapering estimator; minimum mean-squared error; normalized Frobenius risks; shrinkage-to-identity estimator; shrinkage-to-tapering estimation approach; spectral risks; Approximation algorithms; Bandwidth; Covariance matrix; Eigenvalues and eigenfunctions; Estimation; Mean square error methods; Signal processing algorithms; Large covariance estimation; minimax risk; minimum mean-squared errors; shrinkage estimator; tapering operator;
  • fLanguage
    English
  • Journal_Title
    Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1053-587X
  • Type

    jour

  • DOI
    10.1109/TSP.2012.2210546
  • Filename
    6252067