Title :
Linking the Thesaurus for the Social Sciences to the Web of Linked Data
Author :
Alam, Andias Wira ; Kempf, Andreas Oskar ; Zapilko, Benjamin
Author_Institution :
GESIS - Leibniz Inst. for the Social Sci., Cologne, Germany
Abstract :
In this paper, we apply different methods for linking subject headings of the Thesaurus for the Social Sciences (TheSoz) to DBpedia, the nucleus of the Web of Linked Data which is derived from the structured information of Wikipedia. Our method utilizes the backlinks and outlinks within Wikipedia for link detection. We examine to what extent the linking process can be optimized with the help of a network-based similarity measure, in order to achieve a higher precision and recall. We test two baseline methods, string alignment and language property matching and compare them to our own method. Our method outperforms the F-scores of the baselines by 10 percentage points.
Keywords :
Internet; Web sites; social sciences computing; string matching; thesauri; DBpedia; F-scores; Web of Linked Data; Wikipedia; backlinks; language property matching; link detection; linking process; network-based similarity measure; outlinks; string alignment; structured information; thesaurus for the social sciences; Electronic publishing; Encyclopedias; Gold; Internet; Joining processes; Thesauri; Wikipedia; information retrieval; social sciences;
Conference_Titel :
Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on
Conference_Location :
London
DOI :
10.1109/JCDL.2014.6970223