• DocumentCode
    3756764
  • Title

    Predicting Churn of Expert Respondents in Social Networks Using Data Mining Techniques: A Case Study of Stack Overflow

  • Author

    Ifeoma Adaji;Julita Vassileva

  • Author_Institution
    Univ. of Saskatchewan, Saskatoon, SK, Canada
  • fYear
    2015
  • Firstpage
    182
  • Lastpage
    189
  • Abstract
    In Q&A social networks, the few respondents that answer most of the questions are an asset to that network. Being able to predict the churn of these expert respondents will enable the owners of such network put things in place in order to keep them. In this paper, we predicted the churn of expert respondents in Stack Overflow. We identified experts based on the InDegree of the respondents and the value of the incentives earned by these experts from the questions they have answered in the past. Using four data mining techniques: logistic regression, neural networks, support vector machines and random forests, we predicted user churn and evaluated our results with four evaluation metrics: percentage correctly classified, area under receiver operating characteristic curve, precision and recall. Of the four data mining algorithms, random forests performed best with PCC of 76%, ROC area of 0.82, precision of 0.76 and recall of 0.77.
  • Keywords
    "Social network services","Data mining","Classification algorithms","Prediction algorithms","Logistics","Measurement","Algorithm design and analysis"
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2015 IEEE 14th International Conference on
  • Type

    conf

  • DOI
    10.1109/ICMLA.2015.120
  • Filename
    7424306