• Title of article

    Top-k similarity search in heterogeneous information networks with x-star network schema

  • Author/Authors

    Zhang، نويسنده , , Mingxi and Hu، نويسنده , , Hao and He، نويسنده , , Zhenying and Wang، نويسنده , , Wei، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2015
  • Pages
    14
  • From page
    699
  • To page
    712
  • Abstract
    An x-star network is an information network which consists of centers with connections among themselves, and different type attributes linking to these centers. As x-star networks become ubiquitous, extracting knowledge from x-star networks has become an important task. Similarity search in x-star network aims to find the centers similar to a given query center, which has numerous applications including collaborative filtering, community mining and web search. Although existing methods yield promising similar results, such as SimRank and P-Rank, they are not applicable for massive x-star networks. In this paper, we propose a structural-based similarity measure, NetSim, towards efficiently computing similarity between centers in an x-star network. The similarity between attributes is computed in the pre-processing stage by the expected meeting probability over attribute network that is extracted from the whole structure of x-star network. The similarity between centers is computed online according to the attribute similarities based on the intuition that similar centers are linked with similar attributes. NetSim requires less time and space cost than existing methods since the scale of attribute network is significantly smaller than the whole x-star network. For supporting fast online query processing, we develop a pruning algorithm by building a pruning index, which prunes candidate centers that are not promising. Extensive experiments demonstrate the effectiveness and efficiency of our method through comparing with the state-of-the-art measures.
  • Keywords
    Similarity search , information network , x-star network schema
  • Journal title
    Expert Systems with Applications
  • Serial Year
    2015
  • Journal title
    Expert Systems with Applications
  • Record number

    2355452