DocumentCode
683033
Title
Knowledge management and Cultural Heritage repositories: Cross-Lingual Information Retrieval strategies
Author
di Buono, M.P. ; Monteleone, M. ; Marano, Francesco ; Monti, Johanna
Author_Institution
Univ. of Salerno, Fisciano, Italy
Volume
2
fYear
2013
fDate
Oct. 28 2013-Nov. 1 2013
Firstpage
295
Lastpage
302
Abstract
In the last years important initiatives, like the development of the European Library and Europeana, aim to increase the availability of cultural content from various types of providers and institutions. The accessibility to these resources requires the development of environments which allow both to manage multilingual complexity and to preserve the semantic interoperability. The creation of Natural Language Processing (NLP) applications is finalized to the achievement of CrossLingual Information Retrieval (CLIR). This paper presents an ongoing research on language processing based on the LexiconGrammar (LG) approach with the goal of improving knowledge management in the Cultural Heritage repositories. The proposed framework aims to guarantee interoperability between multilingual systems in order to overcome crucial issues like cross-language and cross-collection retrieval. Indeed, the LG methodology tries to overcome the shortcomings of statistical approaches as in Google Translate or Bing by Microsoft concerning Multi-Word Unit (MWU) processing in queries, where the lack of linguistic context represents a serious obstacle to disambiguation. In particular, translations concerning specific domains, as it is has been widely recognized, is unambiguous since the meanings of terms are mono-referential and the type of relation that links a given term to its equivalent in a foreign language is biunivocal, i.e. a one-to-one coupling which causes this relation to be exclusive and reversible. Ontologies are used in CLIR and are considered by several scholars a promising research area to improve the effectiveness of Information Extraction (IE) techniques particularly for technical-domain queries. Therefore, we present a methodological framework which allows to map both the data and the metadata among the language-specific onto
Keywords
history; knowledge management; meta data; natural language processing; ontologies (artificial intelligence); open systems; Bing; CLIR; English-Italian language pair; European Library; Europeana; Google Translate; IE techniques; LG approach; MWU processing; Microsoft; NLP applications; cross-lingual information retrieval strategies; cultural content; cultural heritage repositories; foreign language; information extraction techniques; institutions; knowledge management; language-specific ontologies; lexicon-grammar approach; metadata; methodological framework; multilingual systems; multiword unit processing; natural language processing applications; one-to-one coupling; providers; semantic interoperability; semantic search; statistical approaches; technical-domain queries; Compounds; Dictionaries; Grammar; Ontologies; Pragmatics; Resource description framework; Semantics; Cross-Lingual Information Retrieval; Knowledge Management; Ontologies;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Heritage International Congress (DigitalHeritage), 2013
Conference_Location
Marseille
Print_ISBN
978-1-4799-3168-2
Type
conf
DOI
10.1109/DigitalHeritage.2013.6744767
Filename
6744767
Link To Document