DocumentCode :
1957135
Title :
A language model for parsing very long Chinese sentences
Author :
Chen, Hsin-Hsi
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear :
1993
fDate :
8-11 Nov 1993
Firstpage :
290
Lastpage :
297
Abstract :
By corpus analyses, about 75% of Chinese sentences are composed of more than two sentence segments separated by commas or semicolons. A segment may be a sentence, a noun phrase, a verb phrase, an adjective phrase, an adverbial phrase, or a prepositional phrase. An NP segment may serve as a subject of the next segment or an object of the previous segment. The empty category pro may also appear in the VP segment. The maximal freedom of the uses of pros, the large number of segments, the various segment types, and the associativity problem make sentence parsing difficult. Few parsing systems deal with these problems. The authors regard a segment as a basic parsing unit. It also uses characteristic words, subcategories of verbs, topic chains and some heuristic rules to link the segments into meaningful units. The pro resolution and segment linking are useful for practical applications
Keywords :
computational linguistics; natural languages; NP segment; VP segment; associativity; language model; long Chinese sentences; sentence parsing; sentence segments; Computer science; Couplings; Information analysis; Joining processes; Natural language processing; Natural languages; Particle separators;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence, 1993. TAI '93. Proceedings., Fifth International Conference on
Conference_Location :
Boston, MA
ISSN :
1063-6730
Print_ISBN :
0-8186-4200-9
Type :
conf
DOI :
10.1109/TAI.1993.633970
Filename :
633970
Link To Document :
بازگشت