• DocumentCode
    1735396
  • Title

    A Classifier Ensembling Approach for Imbalanced Social Link Prediction

  • Author

    Hurtado, Juan ; Taweewitchakreeya, Napat ; Xue Kong ; Xingquan Zhu

  • Author_Institution
    Dept. of Comput. & Electr. Eng. & Comput. Sci., Florida Atlantic Univ., Boca Raton, FL, USA
  • Volume
    1
  • fYear
    2013
  • Firstpage
    436
  • Lastpage
    439
  • Abstract
    Supervised learning is a commonly used tool for link prediction in social networks, where data imbalance is a major challenge because only a small portion of nodes may have social connections. In this paper, we propose to use a k-nearest neighbor sampling and a random sampling combined approach to address data imbalance issue for social link prediction. In our solution, we use two sampling approaches to generate multiple copies of the network data, and then use a number of similarity link prediction measures to generate independent variable (features) as training data. For each copy of the sampled data, we train a classifier and use a classifier ensemble for final link prediction. Experimental results on a co-authorship network validate the performance of the proposed design.
  • Keywords
    learning (artificial intelligence); pattern classification; sampling methods; social networking (online); classifier ensembling approach; coauthorship network; data imbalance; final link prediction; imbalanced social link prediction; random sampling combined approach; sampling approaches; social connections; social networks; supervised learning; Accuracy; Couplings; Data models; Decision trees; Logistics; Prediction algorithms; Training data; classification; co-authorship network; data imbalance; link prediction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2013 12th International Conference on
  • Conference_Location
    Miami, FL
  • Type

    conf

  • DOI
    10.1109/ICMLA.2013.88
  • Filename
    6784658