• DocumentCode
    168355
  • Title

    Towards automatic identification of core concepts in educational resources

  • Author

    Sultan, M. Arafat ; Bethard, Steven ; Sumner, Tamara

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Colorado Boulder, Boulder, CO, USA
  • fYear
    2014
  • fDate
    8-12 Sept. 2014
  • Firstpage
    379
  • Lastpage
    388
  • Abstract
    Automatically identifying and extracting key ideas and concepts from educational resources is an important but challenging computational task. We present a supervised machine learning approach to assessing the “coreness” of concepts expressed by resource sentences. The algorithm has been developed and evaluated in the domain of science education where coreness refers to the degree to which a sentence embodies key concepts important to developing a robust understanding of the domain. Our method operates by automatically computing and leveraging the degree of semantic similarity between resource sentences and standard domain concepts designed by human experts for various STEM domains. In our experiments, the algorithm demonstrates high accuracy in identifying sentence coreness when there is agreement between human experts on the coreness rating. We also present performance comparisons with a number of baseline systems.
  • Keywords
    educational computing; learning (artificial intelligence); text analysis; STEM domains; baseline systems; computational task; core concepts; educational resources; resource sentences; science education; semantic similarity; sentence coreness; supervised machine learning; Computational modeling; Feature extraction; Libraries; Semantics; Standards; Training; Core concepts; Semantic similarity; Text summarization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on
  • Conference_Location
    London
  • Type

    conf

  • DOI
    10.1109/JCDL.2014.6970194
  • Filename
    6970194