• DocumentCode
    1198067
  • Title

    iOLAP: A Framework for Analyzing the Internet, Social Networks, and Other Networked Data

  • Author

    Chi, Yun ; Zhu, Shenghuo ; Hino, Koji ; Gong, Yihong ; Zhang, Yi

  • Author_Institution
    NEC Labs. America, Cupertino, CA
  • Volume
    11
  • Issue
    3
  • fYear
    2009
  • fDate
    4/1/2009 12:00:00 AM
  • Firstpage
    372
  • Lastpage
    382
  • Abstract
    As the amount of noisy, unorganized, linked data on the Internet increases dramatically, how to efficiently analyze such data becomes a challenging research problem. In this paper, we propose a framework, iOLAP, that offers functionalities for analyzing networked data from Internet, social networks, scientific paper citations, etc. We first identify four main data dimensions that are common in most of networked data, namely people, relation, content, and time. Motivated by the fact that various dimensions of data jointly affect each other, we propose a polyadic factorization approach to directly model all the dimensions simultaneously in a unified framework. We provide detailed theoretical analysis of the new modeling framework. In addition to the theoretical framework, we also present an efficient implementation of the algorithm that takes advantage of the sparseness of data and has time complexity linear in the number of data records in a dataset. We then apply the proposed models to analyzing the blogosphere and personalizing recommendation in paper citations. Extensive experimental studies showed that our framework is able to provide deep insights jointed obtained from various dimensions of networked data.
  • Keywords
    Internet; citation analysis; data mining; information filtering; social networking (online); Internet; blogosphere analysis; iOLAP; polyadic factorization approach; recommendation personalization; scientific paper citation analysis; social network; time complexity; Information filtering; knowledge management applications; modeling structured; personalization; textual and multimedia data; web mining;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2009.2012912
  • Filename
    4802391