• DocumentCode
    2362471
  • Title

    Text classification and keyword extraction by learning decision trees

  • Author

    Sakakibara, Yasubumi ; Misue, Kazuo ; Koshiba, Takeshi

  • Author_Institution
    Fujitsu Lab., Ltd., Numazu, Shizuoka, Japan
  • fYear
    1993
  • fDate
    1-5 Mar 1993
  • Firstpage
    466
  • Abstract
    Summary form only given. The authors propose a completely new approach to the problem of text classification and automatic keyword extraction by using machine learning techniques. They introduce a class of representations for classifying text data based on decision trees, and present an algorithm for learning it inductively. The algorithm does not need any natural language processing technique, and is robust to noisy data. It is shown that the learning algorithm can be used for automatic extraction of keywords for text retrieval and automatic text categorization. Some experimental results on the use of the algorithm are reported
  • Keywords
    classification; learning (artificial intelligence); linguistics; natural languages; automatic keyword extraction; automatic text categorization; decision trees; learning; machine learning; natural language processing; noisy data; text classification; text retrieval; Binary trees; Books; Classification tree analysis; Data mining; Decision trees; Entropy; Laboratories; Libraries; Noise robustness; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence for Applications, 1993. Proceedings., Ninth Conference on
  • Conference_Location
    Orlando, FL
  • Print_ISBN
    0-8186-3840-0
  • Type

    conf

  • DOI
    10.1109/CAIA.1993.366617
  • Filename
    366617