DocumentCode
2811496
Title
Document Classification with ACM Subject Hierarchy
Author
Wang, Tao ; Desai, Bipin C.
Author_Institution
Concordia Univ., Montreal
fYear
2007
fDate
22-26 April 2007
Firstpage
792
Lastpage
795
Abstract
Text categorization or text classification (TC) has recently received increased research attention from information retrieval and machine learning communities, this focus is driven mostly by the ever growing demand for effective and efficient content-based, document management. In the context of digital library or Web portal application, the problem of text categorization is normally that of classification scheme with a topic hierarchy containing all the pre-defined categories. This paper describes our approach to building the hierarchical text classifier for the experimental CINDI Digital Library . The classification system constructed features a top-to-down, coarse-to-fine categorization procedure. We evaluate our system´s performance by experiment on a self-generated corpus of the Computer Science papers archived in ACM DL.
Keywords
document handling; information retrieval; learning (artificial intelligence); Web portal; coarse-to-fine categorization procedure; content-based, document management; document classification; information retrieval; machine learning; text categorization; text classification; Buildings; Computer science; Content based retrieval; Content management; Information retrieval; Machine learning; Portals; Software libraries; System performance; Text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical and Computer Engineering, 2007. CCECE 2007. Canadian Conference on
Conference_Location
Vancouver, BC
ISSN
0840-7789
Print_ISBN
1-4244-1020-7
Electronic_ISBN
0840-7789
Type
conf
DOI
10.1109/CCECE.2007.203
Filename
4232862
Link To Document