DocumentCode
168296
Title
Identifying the same records across multiple Ukiyo-e image databases using textual data in different languages
Author
Batjargal, Biligsaikhan ; Kuyama, Takeo ; Kimura, Fumitaka ; Maeda, Atsushi
Author_Institution
Kinugasa Res. Organ., Ritsumeikan Univ., Kyoto, Japan
fYear
2014
fDate
8-12 Sept. 2014
Firstpage
193
Lastpage
196
Abstract
This paper proposes a novel method for identifying the same records across multiple databases in different languages. In order to identify the same records, we calculate the similarities between records by comparing the text values of metadata elements. The proposed method, i.e. finding the same records across multiple databases, will help users to know which organization has a certain record and its customized versions regardless of languages and differences in formats. Although the proposed approach was demonstrated on Japanese Ukiyo-e databases, it might be applicable to other disciplines for bridging the gaps between databases in different languages.
Keywords
meta data; natural language processing; text analysis; visual databases; Japanese Ukiyo-e image database; languages; metadata elements; text values; textual data; Art; Couplings; Databases; Educational institutions; Libraries; Measurement; Organizations; Japanese arts; de-duplication; digital library; humanities databases; multilingual record linkage;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on
Conference_Location
London
Type
conf
DOI
10.1109/JCDL.2014.6970167
Filename
6970167
Link To Document