Title :
Document Classification with ACM Subject Hierarchy
Author :
Wang, Tao ; Desai, Bipin C.
Author_Institution :
Concordia Univ., Montreal
Abstract :
Text categorization or text classification (TC) has recently received increased research attention from information retrieval and machine learning communities, this focus is driven mostly by the ever growing demand for effective and efficient content-based, document management. In the context of digital library or Web portal application, the problem of text categorization is normally that of classification scheme with a topic hierarchy containing all the pre-defined categories. This paper describes our approach to building the hierarchical text classifier for the experimental CINDI Digital Library . The classification system constructed features a top-to-down, coarse-to-fine categorization procedure. We evaluate our system´s performance by experiment on a self-generated corpus of the Computer Science papers archived in ACM DL.
Keywords :
document handling; information retrieval; learning (artificial intelligence); Web portal; coarse-to-fine categorization procedure; content-based, document management; document classification; information retrieval; machine learning; text categorization; text classification; Buildings; Computer science; Content based retrieval; Content management; Information retrieval; Machine learning; Portals; Software libraries; System performance; Text categorization;
Conference_Titel :
Electrical and Computer Engineering, 2007. CCECE 2007. Canadian Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
1-4244-1020-7
Electronic_ISBN :
0840-7789
DOI :
10.1109/CCECE.2007.203