• DocumentCode
    2234636
  • Title

    Predicting citation counts of papers

  • Author

    Chen, Junpeng ; Zhang, Chunxia

  • Author_Institution
    School of Software, Beijing Institute of Technology, China
  • fYear
    2015
  • fDate
    6-8 July 2015
  • Firstpage
    434
  • Lastpage
    440
  • Abstract
    The task of citation counts prediction is to predict the citation counts of a paper after a given time period. Future citation counts of papers are an important metric to estimate potential influences of published papers, and will be helpful for researchers to choose representative literatures. This task can be treated as a regression problem. This paper proposes two types of predictive features to represent fundamental characteristics of papers and authors: six content features and ten author features. We introduce the IBM Model 1 to calculate the association probabilities between paper topics which are employed to extract content features, and use the bipartite network projection to obtain the author collaboration network which is utilized to extract author features. Further, we introduce the Gradient Boosted Regression Trees to predict citation counts of papers. Our approach combines contents and topics of papers and multi-dimensional measures of author collaborations in one learning process. Experimental results on the KDD CUP dataset demonstrate that our predicting features and models are effective to solve the problem of citation counts prediction of papers.
  • Keywords
    Biological system modeling; Computational modeling; Computer science; Predictive models; Gradient Boosted Regression Trees; IBM Model 1; bipartite network projection; citation counts prediction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cognitive Informatics & Cognitive Computing (ICCI*CC), 2015 IEEE 14th International Conference on
  • Conference_Location
    Beijing, China
  • Print_ISBN
    978-1-4673-7289-3
  • Type

    conf

  • DOI
    10.1109/ICCI-CC.2015.7259421
  • Filename
    7259421