DocumentCode
3547212
Title
Generating a variety of expressions from visual information and user-designated viewpoints
Author
Noguchi, Y. ; Kondo, Makoto ; Kogure, Satoru ; Konishi, Tsuyoshi ; Itoh, Yoshio ; Takagi, A. ; Asoh, Hidek ; Kobayashi, Ichiro
Author_Institution
Shizuoka Univ., Hamamatsu, Japan
fYear
2013
fDate
2-4 Nov. 2013
Firstpage
322
Lastpage
328
Abstract
This paper reports the development and evaluation of a natural language generation system which generates a variety of language expressions from visual information taken by a CCD camera. The feature of this system is to generate a variety of language expressions from combinations of different syntactic structures and different sets of vocabulary, while managing the generation process based on the user-designated viewpoints. The system converts the visual information into a concept dependency structure using a semantic representation framework proposed by Takagi and Itoh. The system then transforms the structure and divides it into a set of words, deriving a word dependency structure, which is later arranged into a sentence. The transformation of a concept dependency structure and the variation in word segmentation allow the system to generate a variety of sentences from the same visual information. In this paper, we employ user-designated viewpoints to scenes containing more than one object. We designed the parameters of the user-designated viewpoints which enable the system to manage the generation process and to generate a variety of expressions. An evaluation has confirmed that the system generates certain variations according to parameter values set by the user. The variations include expressions referring to attribute values of the objects in the scenes and relative expressions denoting the relations between the targeted object and others.
Keywords
CCD image sensors; natural language processing; word processing; CCD camera; concept dependency structure; language expression generation; natural language generation system; semantic representation framework; syntactic structure combination; user-designated viewpoint; visual information; visual scene; vocabulary; word dependency structure; word segmentation; Educational institutions; Image color analysis; Natural languages; Prototypes; Semantics; Standards; Visualization; natural language generation; relative expressions; viewpoints; visual scene;
fLanguage
English
Publisher
ieee
Conference_Titel
Awareness Science and Technology and Ubi-Media Computing (iCAST-UMEDIA), 2013 International Joint Conference on
Conference_Location
Aizuwakamatsu
Type
conf
DOI
10.1109/ICAwST.2013.6765459
Filename
6765459
Link To Document