DocumentCode :
2119879
Title :
Cloud-Based Name Disambiguation Algorithm
Author :
Juan, Yang ; Hua, He ; Bin, Wu
Author_Institution :
Beijing Key Lab. of Intell. Telecommun. Software & Multimedia, Beijing Univ. of Posts & Telecommun., Beijing, China
Volume :
2
fYear :
2010
fDate :
7-8 Aug. 2010
Firstpage :
155
Lastpage :
158
Abstract :
In Scientific Collaboration Networks, the phenomenon that one author name corresponds to many author entities is very common. Traditional algorithms for name disambiguation performed inefficiently in dealing with massive data. This paper presents a parallel algorithm for solving the name disambiguation problem: first merge authors with same names and similar author information, then divide the scientific collaboration networks into author communities, authors with same name in one community is supposed as one entity with great possibility. The algorithm is based on the Cloud-Computing platform, and has the ability to deal with massive data. In our experiment, the algorithm efficiently processed massive data and achieved an average f-score of 0.93.
Keywords :
data handling; groupware; parallel algorithms; cloud-based name disambiguation algorithm; cloud-computing platform; massive data; parallel algorithm; scientific collaboration networks; Algorithm design and analysis; Cloud computing; Clustering algorithms; Collaboration; Communities; Software; Software algorithms; Cloud Computing; Community Detection; Name Disambiguation; Similarity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science and Management Engineering (ISME), 2010 International Conference of
Conference_Location :
Xi´an
Print_ISBN :
978-1-4244-7669-5
Electronic_ISBN :
978-1-4244-7670-1
Type :
conf
DOI :
10.1109/ISME.2010.33
Filename :
5573917
Link To Document :
بازگشت