• DocumentCode
    3274304
  • Title

    Chinese readability assessment using TF-IDF and SVM

  • Author

    Chen, Yaw-Huei ; Tsai, Yi-Han ; Chen, Yu-Ta

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Chiayi Univ., Chiayi, Taiwan
  • Volume
    2
  • fYear
    2011
  • fDate
    10-13 July 2011
  • Firstpage
    705
  • Lastpage
    710
  • Abstract
    This paper proposes a simple yet effective method to automatically determine the readability of Chinese articles. We use mutual information to select the most important terms from the training data, calculate TF-IDF values based on those terms, and use those values as features for SVM to build classification models that identify articles suitable for lower grade students and middle grade students in elementary school. The experiments on elementary school textbooks produce satisfactory results.
  • Keywords
    classification; educational technology; student experiments; support vector machines; Chinese readability assessment; SVM; TF-IDF; classification models; elementary school textbooks; lower grade students; middle grade students; training data; Educational institutions; Machine learning; Mutual information; Support vector machines; Testing; Training; Training data; Classification; Mutual Information; Readability; SVM; TF-IDF;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2011 International Conference on
  • Conference_Location
    Guilin
  • ISSN
    2160-133X
  • Print_ISBN
    978-1-4577-0305-8
  • Type

    conf

  • DOI
    10.1109/ICMLC.2011.6016783
  • Filename
    6016783