Title :
A parallel parsing algorithm for natural language using tree adjoining grammar
Author :
Nurkkala, Tom ; Kumar, Vipin
Author_Institution :
Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
Abstract :
Tree Adjoining Grammar (TAG) is a powerful grammatical formalism for large-scale natural language processing. However, the computational complexity of parsing algorithms for TAG is high. We introduce a new parallel TAG parsing algorithm for MIMD hypercube multicomputers, using large-granularity grammar partitioning, asynchronous communication, and distributed termination detection. We describe our implementation on the nCUBE/2 parallel computer, and provide experimental results on both random and English grammars. Our algorithm delivers the best performance of any TAG parsing algorithm to date, yielding an almost two order-of-magnitude speedup and good efficiency on up to 256 processors. TAG parsing is a highly unstructured problem. Based on our experience developing a parallel TAG parser, we draw some general conclusions for solving other unstructured problems
Keywords :
computational complexity; grammars; natural languages; parallel algorithms; parallel processing; MIMD hypercube multicomputers; asynchronous communication; computational complexity; distributed termination detection; grammatical formalism; large-granularity grammar partitioning; nCUBE/2 parallel computer; natural language; parallel parsing algorithm; tree adjoining grammar; Algorithm design and analysis; Computer science; Concurrent computing; High performance computing; Hypercubes; Large-scale systems; Natural language processing; Natural languages; Partitioning algorithms; Technical Activities Guide -TAG;
Conference_Titel :
Parallel Processing Symposium, 1994. Proceedings., Eighth International
Conference_Location :
Cancun
Print_ISBN :
0-8186-5602-6
DOI :
10.1109/IPPS.1994.288210