• DocumentCode
    3563722
  • Title

    Extraction and categorization of transition information from large volume of texts using patterns and machine learning

  • Author

    Hori, Sanako ; Murata, Masaki ; Tokuhisa, Masato ; Qing Ma

  • Author_Institution
    Dept. of Inf. & Electron., Tottori Univ., Tottori, Japan
  • fYear
    2014
  • Firstpage
    1102
  • Lastpage
    1107
  • Abstract
    The information on transition of a thing is important for acquiring knowledge on the thing. In this study, we extracted transition information from a large number of sentences using pattern-based methods. We obtained an F-measure of 0.86 for extraction of transition information by this method. In order to improve the results, we then combined the pattern-based methods with supervised machine-learning methods. We obtained F-measures of 0.91 and 0.89 for extraction of transition information by support vector machine and maximum entropy methods, respectively. Thus, we confirmed that the combined approach outperformed the pattern-based method. We also categorized transition information. In the experiments, we obtained an F-measure of 0.6 for transition information categorization for categories containing many events in the training data set. Furthermore, we examined sentences in terms of conceptual changes in their transition information. We classified transition information in various ways. The categories by the classification provide a theoretical basis for future studies on transition information.
  • Keywords
    data mining; learning (artificial intelligence); maximum entropy methods; pattern classification; support vector machines; text analysis; F-measure; conceptual changes; knowledge acquisition; large-text volume; machine learning; maximum entropy methods; pattern-based methods; supervised machine-learning methods; support vector machine; training data set; transition information categorization; transition information classification; transition information extraction; Concrete; Data mining; Manuals; Pattern matching; Resource description framework; Robots; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Soft Computing and Intelligent Systems (SCIS), 2014 Joint 7th International Conference on and Advanced Intelligent Systems (ISIS), 15th International Symposium on
  • Type

    conf

  • DOI
    10.1109/SCIS-ISIS.2014.7044700
  • Filename
    7044700