Title :
A model of the MT lexicon for verbs as an interface between syntactic parsing and semantic representations
Author :
Alam, Yukiko Sasaki
Author_Institution :
Dept. of Digital Media, Hosei Univ., Koganei, Japan
Abstract :
Machine translation (MT) deals with a large amount of data which represents sentence meanings constructed on the basis of the grammatical rules and the lexical knowledge of the source and target languages. As it is hardly possible to make an automated semantic analysis of the source language data, predominant methods in use have been transfer models such as those of phrase-based statistical MT and pattern-based MT, the main task of which is to replace the source language text with the target language text without understanding them. As it is reported that nearly half the text data consist of unanalyzable multiword like collocations, idioms, and phrasal verbs, it would be inevitable to resort to such a method. At the same time, however, it has been observed that MT output quality suffers to a significant extent when the sentences are not divided into proper segments. This paper proposes a model of the lexicon of verbs whose rich relational information contributes to parsing and understanding sentences. While other existing models of lexicons have no or little consideration for their roles of interaction with processes of syntactic parsing and semantic representation, the proposed model is designed for playing a role as an interface between the two processes in handling sentences with regular usages of verbs as well as idiomatic and metaphoric usages.
Keywords :
grammars; language translation; statistical analysis; text analysis; MT lexicon model; automated semantic analysis; grammatical rules; idiomatic usages; lexical knowledge; machine translation; metaphoric usages; pattern-based MT; phrase-based statistical MT; relational information; semantic representation; sentence meanings; source language data; source language text; syntactic parsing; target language text; text data; transfer models; Abstracts; Compounds; Context; Educational institutions; Natural language processing; Semantics; Syntactics; Lexicon for a parser; Lexicon of verbs; Machine translation; Metaphoric usages of verbs;
Conference_Titel :
Information and Automation (ICIA), 2014 IEEE International Conference on
Conference_Location :
Hailar
DOI :
10.1109/ICInfA.2014.6932686