DocumentCode :
2811496
Title :
Document Classification with ACM Subject Hierarchy
Author :
Wang, Tao ; Desai, Bipin C.
Author_Institution :
Concordia Univ., Montreal
fYear :
2007
fDate :
22-26 April 2007
Firstpage :
792
Lastpage :
795
Abstract :
Text categorization or text classification (TC) has recently received increased research attention from information retrieval and machine learning communities, this focus is driven mostly by the ever growing demand for effective and efficient content-based, document management. In the context of digital library or Web portal application, the problem of text categorization is normally that of classification scheme with a topic hierarchy containing all the pre-defined categories. This paper describes our approach to building the hierarchical text classifier for the experimental CINDI Digital Library . The classification system constructed features a top-to-down, coarse-to-fine categorization procedure. We evaluate our system´s performance by experiment on a self-generated corpus of the Computer Science papers archived in ACM DL.
Keywords :
document handling; information retrieval; learning (artificial intelligence); Web portal; coarse-to-fine categorization procedure; content-based, document management; document classification; information retrieval; machine learning; text categorization; text classification; Buildings; Computer science; Content based retrieval; Content management; Information retrieval; Machine learning; Portals; Software libraries; System performance; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Computer Engineering, 2007. CCECE 2007. Canadian Conference on
Conference_Location :
Vancouver, BC
ISSN :
0840-7789
Print_ISBN :
1-4244-1020-7
Electronic_ISBN :
0840-7789
Type :
conf
DOI :
10.1109/CCECE.2007.203
Filename :
4232862
Link To Document :
بازگشت