مرکز منطقه ای اطلاع رساني علوم و فناوري - Natural language grammatical inference with recurrent neural networks

DocumentCode :

1336169

Title :

Natural language grammatical inference with recurrent neural networks

Author :

Lawrence, Steve ; Giles, C. Lee ; Fong, Sandiway

Author_Institution :

NEC Res. Inst., Princeton, NJ, USA

Volume :

Issue :

fYear :

2000

Firstpage :

126

Lastpage :

140

Abstract :

This paper examines the inductive inference of a complex grammar with neural networks and specifically, the task considered is that of training a network to classify natural language sentences as grammatical or ungrammatical, thereby exhibiting the same kind of discriminatory power provided by the Principles and Parameters linguistic framework, or Government-and-Binding theory. Neural networks are trained, without the division into learned vs. innate components assumed by Chomsky (1956), in an attempt to produce the same judgments as native speakers on sharply grammatical/ungrammatical data. How a recurrent neural network could possess linguistic capability and the properties of various common recurrent neural network architectures are discussed. The problem exhibits training behavior which is often not present with smaller grammars and training was initially difficult. However, after implementing several techniques aimed at improving the convergence of the gradient descent backpropagation-through-time training algorithm, significant learning was possible. It was found that certain architectures are better able to learn an appropriate grammar. The operation of the networks and their training is analyzed. Finally, the extraction of rules in the form of deterministic finite state automata is investigated

Keywords :

backpropagation; deterministic automata; finite state machines; grammars; inference mechanisms; linguistics; natural languages; recurrent neural nets; Government-and-Binding theory; Principles and Parameters framework; convergence; deterministic finite state automata; gradient descent backpropagation-through-time; inductive inference; linguistics; natural language grammatical inference; natural language sentence classification; neural net training; recurrent neural networks; Backpropagation algorithms; Computer architecture; Convergence; Data mining; Hidden Markov models; Learning automata; Natural languages; Neural networks; Recurrent neural networks; Simulated annealing;

fLanguage :

English

Journal_Title :

Knowledge and Data Engineering, IEEE Transactions on

Publisher :

ieee

ISSN :

1041-4347

Type :

jour

DOI :

10.1109/69.842255

Filename :

842255

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1336169