DocumentCode
3274304
Title
Chinese readability assessment using TF-IDF and SVM
Author
Chen, Yaw-Huei ; Tsai, Yi-Han ; Chen, Yu-Ta
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Chiayi Univ., Chiayi, Taiwan
Volume
2
fYear
2011
fDate
10-13 July 2011
Firstpage
705
Lastpage
710
Abstract
This paper proposes a simple yet effective method to automatically determine the readability of Chinese articles. We use mutual information to select the most important terms from the training data, calculate TF-IDF values based on those terms, and use those values as features for SVM to build classification models that identify articles suitable for lower grade students and middle grade students in elementary school. The experiments on elementary school textbooks produce satisfactory results.
Keywords
classification; educational technology; student experiments; support vector machines; Chinese readability assessment; SVM; TF-IDF; classification models; elementary school textbooks; lower grade students; middle grade students; training data; Educational institutions; Machine learning; Mutual information; Support vector machines; Testing; Training; Training data; Classification; Mutual Information; Readability; SVM; TF-IDF;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics (ICMLC), 2011 International Conference on
Conference_Location
Guilin
ISSN
2160-133X
Print_ISBN
978-1-4577-0305-8
Type
conf
DOI
10.1109/ICMLC.2011.6016783
Filename
6016783
Link To Document