• DocumentCode
    3425831
  • Title

    Hierarchically classifying documents with multiple labels

  • Author

    Mayne, Andrew ; Perry, Russell

  • Author_Institution
    Oxford Univ., Oxford
  • fYear
    2009
  • fDate
    March 30 2009-April 2 2009
  • Firstpage
    133
  • Lastpage
    139
  • Abstract
    This paper describes the evaluation of a hierarchical classifier for classifying multi-labeled documents organized in a two-level taxonomy. The hierarchical classifier consists of a tree of independent naive Bayes classifiers, with output probabilities from parent classifiers propagated to child classifiers as additional features. Each classifier uses Bi-Normal Feature Separation for word feature selection. Experiments were performed using the Weka Toolkit adapted to deal with multi-labeled documents. The hierarchical classifier accuracy marginally out-performed a set of independent binary classifiers trained to classify documents for each class in the taxonomy.
  • Keywords
    Bayes methods; document handling; feature extraction; pattern classification; probability; trees (mathematics); bi-normal feature separation; hierarchical document classification; independent naive Bayes classifier tree; multi labeled document; output probability; word feature selection; Classification tree analysis; Feature extraction; Feedback; Feeds; Information retrieval; Information technology; Search engines; Taxonomy; User interfaces; Wikipedia;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Data Mining, 2009. CIDM '09. IEEE Symposium on
  • Conference_Location
    Nashville, TN
  • Print_ISBN
    978-1-4244-2765-9
  • Type

    conf

  • DOI
    10.1109/CIDM.2009.4938640
  • Filename
    4938640