DocumentCode
1543204
Title
Thresholding method for dimensionality reduction in recognition systems
Author
Schmid, Natalia A. ; O´Sullivan, Joseph A.
Author_Institution
Electron. Syst. & Signals Res. Lab., Washington Univ., St. Louis, MO, USA
Volume
47
Issue
7
fYear
2001
fDate
11/1/2001 12:00:00 AM
Firstpage
2903
Lastpage
2920
Abstract
Often recognition systems must be designed with a relatively small amount of training data. Plug-in test statistics suffer from large estimation errors, often causing the performance to degrade as the measurement vector dimension increases. Choosing a better test statistic or applying a method of dimensionality reduction are two possible solutions to this problem. In this paper, we consider a recognition problem where the data for each population are assumed to have the same parametric distribution but differ in their unknown parameters. The collected vectors of data as well as their components are assumed to be independent. The system is designed to implement a plug-in log-likelihood ratio test with maximum-likelihood (ML) estimates of the unknown parameters instead of the true parameters. Because a small amount of data is available to estimate the parameters, the performance of such a system is strongly degraded relative to the performance with known parameters. To improve the performance of the system we define a thresholding function that, when incorporated into the plug-in log-likelihood ratio, significantly decreases the probability of error for binary and multiple hypothesis testing problems for the exponential class of populations. We analyze the modified test statistic and present the results of Monte Carlo simulation. Special attention is paid to the complex Gaussian model with zero mean and unknown variances
Keywords
Monte Carlo methods; entropy; error statistics; maximum likelihood estimation; pattern recognition; ML estimates; Monte Carlo simulation; binary hypothesis testing; complex Gaussian model; dimensionality reduction; error probability; exponential population class; large estimation errors; maximum-likelihood estimates; measurement vector dimension; modified test statistic; multiple hypothesis testing; parameter estimation; parametric distribution; pattern recognition systems; plug-in log-likelihood ratio test; plug-in test statistics; relative entropy; thresholding function; thresholding method; training data; zero mean model; Degradation; Error analysis; Estimation error; Maximum likelihood estimation; Parameter estimation; Parametric statistics; Statistical analysis; Statistical distributions; System testing; Training data;
fLanguage
English
Journal_Title
Information Theory, IEEE Transactions on
Publisher
ieee
ISSN
0018-9448
Type
jour
DOI
10.1109/18.959269
Filename
959269
Link To Document