• DocumentCode
    2811496
  • Title

    Document Classification with ACM Subject Hierarchy

  • Author

    Wang, Tao ; Desai, Bipin C.

  • Author_Institution
    Concordia Univ., Montreal
  • fYear
    2007
  • fDate
    22-26 April 2007
  • Firstpage
    792
  • Lastpage
    795
  • Abstract
    Text categorization or text classification (TC) has recently received increased research attention from information retrieval and machine learning communities, this focus is driven mostly by the ever growing demand for effective and efficient content-based, document management. In the context of digital library or Web portal application, the problem of text categorization is normally that of classification scheme with a topic hierarchy containing all the pre-defined categories. This paper describes our approach to building the hierarchical text classifier for the experimental CINDI Digital Library . The classification system constructed features a top-to-down, coarse-to-fine categorization procedure. We evaluate our system´s performance by experiment on a self-generated corpus of the Computer Science papers archived in ACM DL.
  • Keywords
    document handling; information retrieval; learning (artificial intelligence); Web portal; coarse-to-fine categorization procedure; content-based, document management; document classification; information retrieval; machine learning; text categorization; text classification; Buildings; Computer science; Content based retrieval; Content management; Information retrieval; Machine learning; Portals; Software libraries; System performance; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical and Computer Engineering, 2007. CCECE 2007. Canadian Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    0840-7789
  • Print_ISBN
    1-4244-1020-7
  • Electronic_ISBN
    0840-7789
  • Type

    conf

  • DOI
    10.1109/CCECE.2007.203
  • Filename
    4232862