DocumentCode :
3606573
Title :
Directed Information Graphs
Author :
Quinn, Christopher J. ; Kiyavash, Negar ; Coleman, Todd P.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana- Champaign, Urbana, IL, USA
Volume :
61
Issue :
12
fYear :
2015
Firstpage :
6887
Lastpage :
6909
Abstract :
We propose a graphical model for representing networks of stochastic processes, the minimal generative model graph. It is based on reduced factorizations of the joint distribution over time. We show that under appropriate conditions, it is unique and consistent with another type of graphical model, the directed information graph, which is based on a generalization of Granger causality. We demonstrate how directed information quantifies Granger causality in a particular sequential prediction setting. We also develop efficient methods to estimate the topological structure from data that obviate estimating the joint statistics. One algorithm assumes upper bounds on the degrees and uses the minimal dimension statistics necessary. In the event that the upper bounds are not valid, the resulting graph is nonetheless an optimal approximation in terms of Kullback-Leibler (KL) divergence. Another algorithm uses near-minimal dimension statistics when no bounds are known, but the distribution satisfies a certain criterion. Analogous to how structure learning algorithms for undirected graphical models use mutual information estimates, these algorithms use directed information estimates. We characterize the sample-complexity of two plug-in directed information estimators and obtain confidence intervals. For the setting when point estimates are unreliable, we propose an algorithm that uses confidence intervals to identify the best approximation that is robust to estimation error. Last, we demonstrate the effectiveness of the proposed algorithms through the analysis of both synthetic data and real data from the Twitter network. In the latter case, we identify which news sources influence users in the network by merely analyzing tweet times.
Keywords :
directed graphs; Granger causality generalization; KL divergence; Kullback-Leibler divergence; confidence intervals; directed information graphs; estimation error; minimal generative model graph; mutual information estimation; near-minimal dimension statistics; plug-in directed information estimators; real data; sequential prediction setting; stochastic processes; structure learning algorithms; synthetic data; topological structure; twitter network; undirected graphical models; Approximation algorithms; Approximation methods; Graphical models; Joints; Mutual information; Social network services; Topology; Causality; directed information; generative models; graphical models; network inference;
fLanguage :
English
Journal_Title :
Information Theory, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9448
Type :
jour
DOI :
10.1109/TIT.2015.2478440
Filename :
7273888
Link To Document :
بازگشت