Title :
Evaluation of Features for Author Name Disambiguation Using Linear Support Vector Machines
Author :
Piotr Jan Dendek;Lukasz Bolikowski;Michal Lukasik
Author_Institution :
Interdiscipl. Centre for Math. &
fDate :
3/1/2012 12:00:00 AM
Abstract :
Author name disambiguation allows to distinguish between two or more authors sharing the same name. In a previous paper, we have proposed a name disambiguation framework in which for each author name in each article we build a context consisting of classification codes, bibliographic references, co-authors, etc. Then, by pair wise comparison of contexts, we have been grouping contributions likely referring to the same people. In this paper we examine which elements of the context are most effective in author name disambiguation. We employ linear Support Vector Machines (SVM) to find the most influential features.
Keywords :
"Support vector machines","Electronic mail","Null value","Libraries","Vectors","Context","Conferences"
Conference_Titel :
Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
Print_ISBN :
978-1-4673-0868-7
DOI :
10.1109/DAS.2012.36