• Title of article

    Disambiguating Authors by Pairwise Classification

  • Author/Authors

    LIN, Quan Huazhong University of Science and Technology - Department of Computer Science, China , WANG, Bo Nanjing University of Aeronautics and Astronautics - Department of Computer Science, China , DU, Yuan Tsinghua University - Department of Computer Science, China , WANG, Xuezhi Tsinghua University - Department of Computer Science, China , LI, Yuhua , CHEN, Songcan Nanjing University of Aeronautics and Astronautics - Department of Computer Science, China

  • From page
    668
  • To page
    677
  • Abstract
    Name ambiguity is a critical problem in many applications, in particular in online bibliography systems,such as DBLP, ACM, and CiteSeerx. Despite the many studies, this problem is still not resolved and is becoming even more serious, especially with the increasing popularity of Web 2.0. This paper addresses the problem in the academic researcher social network ArnetMiner using a supervised method for exploiting all side information including co-author, organization, paper citation, title similarity, author’s homepage, web constraint, and user feedback. The method automatically determines the person number k. Tests on the researcher social network with up to 100 different names show that the method significantly outperforms the baseline method using an unsupervised attribute-augmented graph clustering algorithm.
  • Keywords
    disambiguating , pairwise classification , arnetminer
  • Journal title
    Tsinghua Science and Technology
  • Journal title
    Tsinghua Science and Technology
  • Record number

    2535330