• DocumentCode
    1303446
  • Title

    Generalized queries on probabilistic context-free grammars

  • Author

    Pynadath, David V. ; Wellman, Michael P.

  • Author_Institution
    Artificial Intelligence Lab., Michigan Univ., Ann Arbor, MI, USA
  • Volume
    20
  • Issue
    1
  • fYear
    1998
  • fDate
    1/1/1998 12:00:00 AM
  • Firstpage
    65
  • Lastpage
    77
  • Abstract
    Probabilistic context-free grammars (PCFGs) provide a simple way to represent a particular class of distributions over sentences in a context-free language. Efficient parsing algorithms for answering particular queries about a PCFG (i.e., calculating the probability of a given sentence, or finding the most likely parse) have been developed and applied to a variety of pattern-recognition problems. We extend the class of queries that can be answered in several ways: (1) allowing missing tokens in a sentence or sentence fragment, (2) supporting queries about intermediate structure, such as the presence of particular nonterminals, and (3) flexible conditioning on a variety of types of evidence. Our method works by constructing a Bayesian network to represent the distribution of parse trees induced by a given PCFG. The network structure mirrors that of the chart in a standard parser, and is generated using a similar dynamic programming approach. We present an algorithm for constructing Bayesian networks from PCFGs, and show how queries or patterns of queries on the network correspond to interesting queries on PCFGs. The network formalism also supports extensions to encode various context sensitivities within the probabilistic dependency structure
  • Keywords
    computational complexity; context-free grammars; directed graphs; probability; query processing; Bayesian network; context sensitivities; dynamic programming approach; flexible conditioning; generalized queries; missing tokens; nonterminals; parsing algorithms; pattern-recognition problems; probabilistic context-free grammars; probabilistic dependency structure; sentence fragment; Bayesian methods; Computer languages; Context modeling; Mirrors; Natural languages; Probability; RNA; Speech; Stochastic processes; Uncertainty;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/34.655650
  • Filename
    655650