Ambiguity resolution analysis in incremental parsing of natural language

Author

Costa, Fabrizio ; Frasconi, Paolo ; Lombardo, Vincenzo ; Sturt, Patrick ; Soda, Giovanni

Author_Institution

Dipt. di Sistemi e Informatica, Univ. di Firenze, Italy

Volume

16

Issue

4

fYear

2005

fDate

7/1/2005 12:00:00 AM

Firstpage

959

Lastpage

971

Abstract

Incremental parsing gains its importance in natural language processing and psycholinguistics because of its cognitive plausibility. Modeling the associated cognitive data structures, and their dynamics, can lead to a better understanding of the human parser. In earlier work, we have introduced a recursive neural network (RNN) capable of performing syntactic ambiguity resolution in incremental parsing. In this paper, we report a systematic analysis of the behavior of the network that allows us to gain important insights about the kind of information that is exploited to resolve different forms of ambiguity. In attachment ambiguities, in which a new phrase can be attached at more than one point in the syntactic left context, we found that learning from examples allows us to predict the location of the attachment point with high accuracy, while the discrimination amongst alternative syntactic structures with the same attachment point is slightly better than making a decision purely based on frequencies. We also introduce several new ideas to enhance the architectural design, obtaining significant improvements of prediction accuracy, up to 25% error reduction on the same dataset used in previous work. Finally, we report large scale experiments on the entire Wall Street Journal section of the Penn Treebank. The best prediction accuracy of the model on this large dataset is 87.6%, a relative error reduction larger than 50% compared to previous results.

Keywords

cognitive systems; data structures; grammars; incremental compilers; linguistics; natural languages; neural nets; very large databases; Penn Treebank; Wall Street Journal; ambiguity resolution analysis; associated cognitive data structures; cognitive plausibility; incremental parsing; large dataset; natural language processing; prediction accuracy; psycholinguistics; recursive neural network; Accuracy; Data structures; Frequency; Humans; Information analysis; Natural language processing; Natural languages; Neural networks; Psychology; Recurrent neural networks; First-pass attachment; incremental parsing; learning preferences; recursive neural networks (RNNs); structured data; Algorithms; Linguistics; Natural Language Processing; Neural Networks (Computer); Pattern Recognition, Automated; Vocabulary, Controlled;

fLanguage

English

Journal_Title

Neural Networks, IEEE Transactions on

Publisher

ieee

ISSN

1045-9227

Type

jour

DOI

10.1109/TNN.2005.849837

Filename

1461437