Title of article :
Predicting Library of Congress Classifications From
Library of Congress Subject Headings
Author/Authors :
Eibe Frank، نويسنده , , Gordon W. Paynter ، نويسنده ,
Issue Information :
ماهنامه با شماره پیاپی سال 2004
Abstract :
This paper addresses the problem of automatically assigning
a Library of Congress Classification (LCC) to a
work given its set of Library of Congress Subject Headings
(LCSH). LCCs are organized in a tree: The root node
of this hierarchy comprises all possible topics, and leaf
nodes correspond to the most specialized topic areas
defined. We describe a procedure that, given a resource
identified by its LCSH, automatically places that resource
in the LCC hierarchy. The procedure uses machine
learning techniques and training data from a large
library catalog to learn a model that maps from sets of
LCSH to classifications from the LCC tree. We present
empirical results for our technique showing its accuracy
on an independent collection of 50,000 LCSH/LCC pairs
Journal title :
Journal of the American Society for Information Science and Technology
Journal title :
Journal of the American Society for Information Science and Technology