Title :
An Algorithm to Tackle the Name Authority Control Problem Using Semantic Information
Author :
Chavez-Aragon, A. ; Cruz, José Federico Ramirez ; Reyes-Galaviz, Orion F. ; Ayanegui-Santiago, Huberto ; Portilla, Alberto
Author_Institution :
Fac. de Cienc. Basicas, Ing. y Tecnol., Univ. Autonoma de Tlaxcala, Tlaxcala, Mexico
Abstract :
Name disambiguation is a focal point on realworld information integration, analysis, and data mining. This problem, also known as the classical name authority control problem, consists in "same authors with different spellings" or "different authors with the same spelling". The problem is augmented in large data repositories where information changes and grows over time (e.g., DBLP, CiteSeer). In particular, we are mainly interested in DBLP because we use this database to discover the publishing movement among Mexican researchers. In this paper, we propose an algorithm that solves the name authority control problem. Our approach aims to improve the identity author tracking by using semantic information about authors, even thought they use different name varieties to sign their research work over time.
Keywords :
bibliographic systems; data mining; database management systems; DBLP; digital bibliography-library project; identity author tracking; name authority control; name disambiguation; semantic information; Bibliographies; Computer science; Data analysis; Data mining; Databases; Informatics; Information analysis; Information retrieval; Laboratories; Publishing; Name disambiguation; identity uncertainty; name authority control problem; semantic information;
Conference_Titel :
Computer Science (ENC), 2009 Mexican International Conference on
Conference_Location :
Mexico City
Print_ISBN :
978-1-4244-5258-3
DOI :
10.1109/ENC.2009.38