• DocumentCode
    827090
  • Title

    Dimensionality reduction in automatic knowledge acquisition: a simple greedy search approach

  • Author

    Huang, Samuel H.

  • Author_Institution
    Dept. of Mech., Ind., & Nucl. Eng., Cincinnati Univ., OH, USA
  • Volume
    15
  • Issue
    6
  • fYear
    2003
  • Firstpage
    1364
  • Lastpage
    1373
  • Abstract
    Knowledge acquisition is the process of collecting domain knowledge, documenting the knowledge, and transforming it into a computerized representation. Due to the difficulties involved in eliciting knowledge from human experts, knowledge acquisition was identified as a bottleneck in the development of knowledge-based system. Over the past decades, a number of automatic knowledge acquisition techniques have been developed. However, the performance of these techniques suffers from the so called curse of dimensionality, i.e., difficulties arise when many irrelevant (or redundant) parameters exist. This paper presents a heuristic approach based on statistics and greedy search for dimensionality reduction to facilitate automatic knowledge acquisition. The approach deals with classification problems. Specifically, Chi-square statistics are used to rank the importance of individual parameters. Then, a backward search procedure is employed to eliminate parameters (less important parameters first) that do not contribute to class separability. The algorithm is very efficient and was found to be effective when applied to a variety of problems with different characteristics.
  • Keywords
    heuristic programming; knowledge acquisition; pattern classification; search problems; statistics; Chi-square statistics; automatic knowledge acquisition; backward search procedure; class separability; classification; computerized representation; dimensionality reduction; greedy search; heuristic approach; knowledge-based system; parameter importance ranking; statistics; Artificial intelligence; Decision trees; Expert systems; Humans; Knowledge acquisition; Knowledge based systems; Knowledge engineering; Machine learning; Statistics; Testing;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2003.1245278
  • Filename
    1245278