DocumentCode :
827090
Title :
Dimensionality reduction in automatic knowledge acquisition: a simple greedy search approach
Author :
Huang, Samuel H.
Author_Institution :
Dept. of Mech., Ind., & Nucl. Eng., Cincinnati Univ., OH, USA
Volume :
15
Issue :
6
fYear :
2003
Firstpage :
1364
Lastpage :
1373
Abstract :
Knowledge acquisition is the process of collecting domain knowledge, documenting the knowledge, and transforming it into a computerized representation. Due to the difficulties involved in eliciting knowledge from human experts, knowledge acquisition was identified as a bottleneck in the development of knowledge-based system. Over the past decades, a number of automatic knowledge acquisition techniques have been developed. However, the performance of these techniques suffers from the so called curse of dimensionality, i.e., difficulties arise when many irrelevant (or redundant) parameters exist. This paper presents a heuristic approach based on statistics and greedy search for dimensionality reduction to facilitate automatic knowledge acquisition. The approach deals with classification problems. Specifically, Chi-square statistics are used to rank the importance of individual parameters. Then, a backward search procedure is employed to eliminate parameters (less important parameters first) that do not contribute to class separability. The algorithm is very efficient and was found to be effective when applied to a variety of problems with different characteristics.
Keywords :
heuristic programming; knowledge acquisition; pattern classification; search problems; statistics; Chi-square statistics; automatic knowledge acquisition; backward search procedure; class separability; classification; computerized representation; dimensionality reduction; greedy search; heuristic approach; knowledge-based system; parameter importance ranking; statistics; Artificial intelligence; Decision trees; Expert systems; Humans; Knowledge acquisition; Knowledge based systems; Knowledge engineering; Machine learning; Statistics; Testing;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2003.1245278
Filename :
1245278
Link To Document :
بازگشت