Title :
Probabilistic Context-Free Grammars Estimated from Infinite Distributions
Author :
Corazza, A. ; Satta, G.
Author_Institution :
Univ. of Naples, Napoli
Abstract :
In this paper, we consider probabilistic context-free grammars, a class of generative devices that has been successfully exploited in several applications of syntactic pattern matching, especially in statistical natural language parsing. We investigate the problem of training probabilistic context-free grammars on the basis of distributions defined over an infinite set of trees or an infinite set of sentences by minimizing the cross-entropy. This problem has applications in cases of context-free approximation of distributions generated by more expressive statistical models. We show several interesting theoretical properties of probabilistic context-free grammars that are estimated in this way, including the previously unknown equivalence between the grammar cross-entropy with the input distribution and the so-called derivational entropy of the grammar itself. We discuss important consequences of these results involving the standard application of the maximum-likelihood estimator on finite tree and sentence samples, as well as other finite-state models such as hidden Markov models and probabilistic finite automata.
Keywords :
computational linguistics; context-free grammars; iterative methods; learning (artificial intelligence); maximum likelihood estimation; statistical distributions; trees (mathematics); PCFG training; cross-entropy minimization; finite tree; finite-state models; hidden Markov models; infinite distributions; iterative estimation algorithm; maximum-likelihood estimator; probabilistic context-free grammars; probabilistic finite automata; sentence samples; statistical natural language parsing; syntactic pattern matching; Biological system modeling; Character recognition; Computational biology; Context modeling; Entropy; Hidden Markov models; Maximum likelihood estimation; Natural language processing; Natural languages; Pattern matching; Hidden Markov Models.; Probabilistic context-free grammars; cross-entropy; derivational entropy; expectation-maximization methods; maximum-likelihood estimation;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2007.1065