• DocumentCode
    1543204
  • Title

    Thresholding method for dimensionality reduction in recognition systems

  • Author

    Schmid, Natalia A. ; O´Sullivan, Joseph A.

  • Author_Institution
    Electron. Syst. & Signals Res. Lab., Washington Univ., St. Louis, MO, USA
  • Volume
    47
  • Issue
    7
  • fYear
    2001
  • fDate
    11/1/2001 12:00:00 AM
  • Firstpage
    2903
  • Lastpage
    2920
  • Abstract
    Often recognition systems must be designed with a relatively small amount of training data. Plug-in test statistics suffer from large estimation errors, often causing the performance to degrade as the measurement vector dimension increases. Choosing a better test statistic or applying a method of dimensionality reduction are two possible solutions to this problem. In this paper, we consider a recognition problem where the data for each population are assumed to have the same parametric distribution but differ in their unknown parameters. The collected vectors of data as well as their components are assumed to be independent. The system is designed to implement a plug-in log-likelihood ratio test with maximum-likelihood (ML) estimates of the unknown parameters instead of the true parameters. Because a small amount of data is available to estimate the parameters, the performance of such a system is strongly degraded relative to the performance with known parameters. To improve the performance of the system we define a thresholding function that, when incorporated into the plug-in log-likelihood ratio, significantly decreases the probability of error for binary and multiple hypothesis testing problems for the exponential class of populations. We analyze the modified test statistic and present the results of Monte Carlo simulation. Special attention is paid to the complex Gaussian model with zero mean and unknown variances
  • Keywords
    Monte Carlo methods; entropy; error statistics; maximum likelihood estimation; pattern recognition; ML estimates; Monte Carlo simulation; binary hypothesis testing; complex Gaussian model; dimensionality reduction; error probability; exponential population class; large estimation errors; maximum-likelihood estimates; measurement vector dimension; modified test statistic; multiple hypothesis testing; parameter estimation; parametric distribution; pattern recognition systems; plug-in log-likelihood ratio test; plug-in test statistics; relative entropy; thresholding function; thresholding method; training data; zero mean model; Degradation; Error analysis; Estimation error; Maximum likelihood estimation; Parameter estimation; Parametric statistics; Statistical analysis; Statistical distributions; System testing; Training data;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/18.959269
  • Filename
    959269