DocumentCode :
2418993
Title :
Data collection and normalization for building the Scenario-Based Lexical Knowledge Resource of a text-to-scene conversion system
Author :
Rouhizadeh, Masoud ; Bowler, Margit ; Sproat, Richard ; Coyne, Bob
Author_Institution :
Center for Spoken Language Understanding, Oregon Health & Sci. Univ., Portland, OR, USA
fYear :
2010
fDate :
9-10 Dec. 2010
Firstpage :
25
Lastpage :
30
Abstract :
WordsEye is a system for converting from English text into three-dimensional graphical scenes that represent that text. It works by performing syntactic and semantic analyses on the input text, producing a description of the arrangement of objects in a scene. At the core of WordsEye is the Scenario-Based Lexical Knowledge Resource (SBLR), a unified knowledge base and representational system for expressing lexical and real-world knowledge needed to depict scenes from text. This paper explores information collection methods for building the SBLR, using Amazon´s Mechanical Turk (AMT) and manual normalization of raw AMT data. The paper follows with manual review of existing relations in the SBLR and classification of the AMT data into existing and new semantic relations. Since manual annotation is a time-consuming and expensive approach, we also explored the use of automatic normalization of AMT data through log-odds and log-likelihood ratios extracted from the English Gigaword corpus, as well as through WordNet similarity measures.
Keywords :
knowledge representation; multimedia computing; natural language interfaces; natural language processing; natural scenes; text analysis; Amazon mechanical turk; WordsEye; data collection; data normalization; scenario based lexical knowledge resource; text to scene conversion system; three dimensional graphical scene; Animation; Data mining; Libraries; Manuals; Natural languages; Semantics; Three dimensional displays;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Semantic Media Adaptation and Personalization (SMAP), 2010 5th International Workshop on
Conference_Location :
Limmassol
Print_ISBN :
978-1-4244-8603-8
Electronic_ISBN :
978-1-4244-8601-4
Type :
conf
DOI :
10.1109/SMAP.2010.5706851
Filename :
5706851
Link To Document :
بازگشت