• DocumentCode
    991213
  • Title

    Ambiguity resolution analysis in incremental parsing of natural language

  • Author

    Costa, Fabrizio ; Frasconi, Paolo ; Lombardo, Vincenzo ; Sturt, Patrick ; Soda, Giovanni

  • Author_Institution
    Dipt. di Sistemi e Informatica, Univ. di Firenze, Italy
  • Volume
    16
  • Issue
    4
  • fYear
    2005
  • fDate
    7/1/2005 12:00:00 AM
  • Firstpage
    959
  • Lastpage
    971
  • Abstract
    Incremental parsing gains its importance in natural language processing and psycholinguistics because of its cognitive plausibility. Modeling the associated cognitive data structures, and their dynamics, can lead to a better understanding of the human parser. In earlier work, we have introduced a recursive neural network (RNN) capable of performing syntactic ambiguity resolution in incremental parsing. In this paper, we report a systematic analysis of the behavior of the network that allows us to gain important insights about the kind of information that is exploited to resolve different forms of ambiguity. In attachment ambiguities, in which a new phrase can be attached at more than one point in the syntactic left context, we found that learning from examples allows us to predict the location of the attachment point with high accuracy, while the discrimination amongst alternative syntactic structures with the same attachment point is slightly better than making a decision purely based on frequencies. We also introduce several new ideas to enhance the architectural design, obtaining significant improvements of prediction accuracy, up to 25% error reduction on the same dataset used in previous work. Finally, we report large scale experiments on the entire Wall Street Journal section of the Penn Treebank. The best prediction accuracy of the model on this large dataset is 87.6%, a relative error reduction larger than 50% compared to previous results.
  • Keywords
    cognitive systems; data structures; grammars; incremental compilers; linguistics; natural languages; neural nets; very large databases; Penn Treebank; Wall Street Journal; ambiguity resolution analysis; associated cognitive data structures; cognitive plausibility; incremental parsing; large dataset; natural language processing; prediction accuracy; psycholinguistics; recursive neural network; Accuracy; Data structures; Frequency; Humans; Information analysis; Natural language processing; Natural languages; Neural networks; Psychology; Recurrent neural networks; First-pass attachment; incremental parsing; learning preferences; recursive neural networks (RNNs); structured data; Algorithms; Linguistics; Natural Language Processing; Neural Networks (Computer); Pattern Recognition, Automated; Vocabulary, Controlled;
  • fLanguage
    English
  • Journal_Title
    Neural Networks, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9227
  • Type

    jour

  • DOI
    10.1109/TNN.2005.849837
  • Filename
    1461437