• DocumentCode
    3606573
  • Title

    Directed Information Graphs

  • Author

    Quinn, Christopher J. ; Kiyavash, Negar ; Coleman, Todd P.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana- Champaign, Urbana, IL, USA
  • Volume
    61
  • Issue
    12
  • fYear
    2015
  • Firstpage
    6887
  • Lastpage
    6909
  • Abstract
    We propose a graphical model for representing networks of stochastic processes, the minimal generative model graph. It is based on reduced factorizations of the joint distribution over time. We show that under appropriate conditions, it is unique and consistent with another type of graphical model, the directed information graph, which is based on a generalization of Granger causality. We demonstrate how directed information quantifies Granger causality in a particular sequential prediction setting. We also develop efficient methods to estimate the topological structure from data that obviate estimating the joint statistics. One algorithm assumes upper bounds on the degrees and uses the minimal dimension statistics necessary. In the event that the upper bounds are not valid, the resulting graph is nonetheless an optimal approximation in terms of Kullback-Leibler (KL) divergence. Another algorithm uses near-minimal dimension statistics when no bounds are known, but the distribution satisfies a certain criterion. Analogous to how structure learning algorithms for undirected graphical models use mutual information estimates, these algorithms use directed information estimates. We characterize the sample-complexity of two plug-in directed information estimators and obtain confidence intervals. For the setting when point estimates are unreliable, we propose an algorithm that uses confidence intervals to identify the best approximation that is robust to estimation error. Last, we demonstrate the effectiveness of the proposed algorithms through the analysis of both synthetic data and real data from the Twitter network. In the latter case, we identify which news sources influence users in the network by merely analyzing tweet times.
  • Keywords
    directed graphs; Granger causality generalization; KL divergence; Kullback-Leibler divergence; confidence intervals; directed information graphs; estimation error; minimal generative model graph; mutual information estimation; near-minimal dimension statistics; plug-in directed information estimators; real data; sequential prediction setting; stochastic processes; structure learning algorithms; synthetic data; topological structure; twitter network; undirected graphical models; Approximation algorithms; Approximation methods; Graphical models; Joints; Mutual information; Social network services; Topology; Causality; directed information; generative models; graphical models; network inference;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2015.2478440
  • Filename
    7273888