Title :
Covariance estimation for classifying high dimensional data
Author :
Hoffbeck, Joseph P. ; Landgrebe, David A.
Author_Institution :
Sch. of Electr. Eng., Purdue Univ., West Lafayette, IN, USA
Abstract :
When classifying data with the Gaussian maximum likelihood classifier, the mean vector and covariance matrix of each class usually are not known and must be estimated from training samples. For p-dimensional data, the sample covariance estimate is singular, and therefore unusable, if fewer than p+1 training samples from each class are available, and it is a poor estimate of the true covariance unless many more than p+1 samples are available. Since inaccurate estimates of the covariance matrix lead to lowered classification accuracy and labeling training samples can be difficult and expensive in remote sensing applications, having too few training samples is a major impediment in using the Gaussian maximum likelihood classifier with high dimensional remote sensing data. In the paper, a new covariance estimator is presented that selects an appropriate mixture of the sample covariance and the common covariance estimates. The mixture deemed appropriate is the one that provides the best fit to the training samples in the sense that it maximizes the average likelihood of training samples not used in the estimates. When the number of training samples is limited or when the covariance matrices of the classes are similar, this estimator tends to select an estimate close to the common covariance, otherwise it favors the sample covariance estimate. Since it is non-singular whenever the common covariance estimate is non-singular, the new estimator can be used even when some of the sample covariance matrices are singular
Keywords :
covariance matrices; geophysical signal processing; image classification; image sampling; maximum likelihood estimation; remote sensing; Gaussian maximum likelihood classifier; classification accuracy; common covariance; covariance estimation; covariance matrix; high dimensional data; labeling; mean vector; remote sensing applications; sample covariance; training samples; Covariance matrix; H infinity control; Impedance; Labeling; Maximum likelihood estimation; Optical imaging; Parameter estimation; Remote sensing; Spectroscopy; Training data;
Conference_Titel :
Geoscience and Remote Sensing Symposium, 1995. IGARSS '95. 'Quantitative Remote Sensing for Science and Applications', International
Conference_Location :
Firenze
Print_ISBN :
0-7803-2567-2
DOI :
10.1109/IGARSS.1995.521127