Title :
Identifying the same records across multiple Ukiyo-e image databases using textual data in different languages
Author :
Batjargal, Biligsaikhan ; Kuyama, Takeo ; Kimura, Fumitaka ; Maeda, Atsushi
Author_Institution :
Kinugasa Res. Organ., Ritsumeikan Univ., Kyoto, Japan
Abstract :
This paper proposes a novel method for identifying the same records across multiple databases in different languages. In order to identify the same records, we calculate the similarities between records by comparing the text values of metadata elements. The proposed method, i.e. finding the same records across multiple databases, will help users to know which organization has a certain record and its customized versions regardless of languages and differences in formats. Although the proposed approach was demonstrated on Japanese Ukiyo-e databases, it might be applicable to other disciplines for bridging the gaps between databases in different languages.
Keywords :
meta data; natural language processing; text analysis; visual databases; Japanese Ukiyo-e image database; languages; metadata elements; text values; textual data; Art; Couplings; Databases; Educational institutions; Libraries; Measurement; Organizations; Japanese arts; de-duplication; digital library; humanities databases; multilingual record linkage;
Conference_Titel :
Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on
Conference_Location :
London
DOI :
10.1109/JCDL.2014.6970167