DocumentCode
2977187
Title
Comparison of attribute selection techniques and algorithms in classifying bad behaviors of vocational education students
Author
Wongpun, Sukontip ; Srivihok, Anongnart
Author_Institution
Dept. of Comput. Sci., Kasetsart Univ., Bangkok
fYear
2008
fDate
26-29 Feb. 2008
Firstpage
526
Lastpage
531
Abstract
This study presents the comparison of attribute selection techniques which used for classifying the bad behaviors of vocational education students. There are two classification methods: hybrid classification and single classification. Hybrid classification includes two steps, step one is attribute selection by search method using genetic search and results are compared by three evaluators: 1) Correlation-based Feature Selection (CFS) 2) Consistency-based Subset Evaluation and 3) Wrapper Subset Evaluation. Step two is the classification of data set by using selected attributed from step one and four classification algorithms. Next, Simple classification used classification algorithms only without attribute selection. The four classification algorithms that used in this experiment for comparing in two methods are : 1) Naive Bayes classifier 2) Baysian Belief Network 3) C4.5 algorithm and 4) RIPPER algorithm. The measurements of classification efficiency had been obtained by using the k-fold cross validation technique. From the experiment, it was found that hybrid classification technique using genetic search and CFS evaluator with C4.5 algorithm, gives the highest accuracy rate at 82.52%. However, results from F-measure evaluation showed that C4.5 algorithm did not fit for all data types. The hybrid classification technique using genetic search and wrapper subset with Baysian belief network can give a better precision value which can be seen in the F-measure, and it gives the accuracy rate at 82.42%.
Keywords
Bayes methods; belief networks; data mining; educational administrative data processing; genetic algorithms; human factors; pattern classification; search problems; set theory; Baysian belief network; C4.5 algorithm; RIPPER algorithm; attribute selection technique; bad student behavior classification; data mining; genetic search method; hybrid classification technique; k-fold cross validation technique; naive Bayes classifier; single classification technique; vocational education student; wrapper subset; Classification algorithms; Computer science; Computer science education; Data analysis; Data mining; Ecosystems; Electronic mail; Genetics; Information analysis; Search methods; Attribute selection; Baysian belief network; C4.5 Algorithm; Naive Bayes classifier; RIPPER Algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Ecosystems and Technologies, 2008. DEST 2008. 2nd IEEE International Conference on
Conference_Location
Phitsanulok
Print_ISBN
978-1-4244-1489-5
Electronic_ISBN
978-1-4244-1490-1
Type
conf
DOI
10.1109/DEST.2008.4635213
Filename
4635213
Link To Document