Title :
Thesaurus Based Term Ranking for Keyword Extraction
Author :
Gazendam, Luit ; Wartena, Christian ; Brussee, Rogier
Author_Institution :
Novay, Enschede, Netherlands
fDate :
Aug. 30 2010-Sept. 3 2010
Abstract :
In many cases keywords from a restricted set of possible keywords have to be assigned to texts. A common way to find the best keywords is to rank terms occurring in the text according to their tf.idf value. This requires a corpus of texts from which document frequencies can be derived. In this paper we show that we can obtain results of the same quality without the usage of a background corpus, using relations between terms provided in a thesaurus.
Keywords :
thesauri; background corpus; document frequencies; keyword extraction; thesaurus based term ranking; Coherence; History; Libraries; Semantics; Thesauri; Vocabulary; extraction; keywords; term ranking; thesaurus;
Conference_Titel :
Database and Expert Systems Applications (DEXA), 2010 Workshop on
Conference_Location :
Bilbao
Print_ISBN :
978-1-4244-8049-4
DOI :
10.1109/DEXA.2010.31