• DocumentCode
    168296
  • Title

    Identifying the same records across multiple Ukiyo-e image databases using textual data in different languages

  • Author

    Batjargal, Biligsaikhan ; Kuyama, Takeo ; Kimura, Fumitaka ; Maeda, Atsushi

  • Author_Institution
    Kinugasa Res. Organ., Ritsumeikan Univ., Kyoto, Japan
  • fYear
    2014
  • fDate
    8-12 Sept. 2014
  • Firstpage
    193
  • Lastpage
    196
  • Abstract
    This paper proposes a novel method for identifying the same records across multiple databases in different languages. In order to identify the same records, we calculate the similarities between records by comparing the text values of metadata elements. The proposed method, i.e. finding the same records across multiple databases, will help users to know which organization has a certain record and its customized versions regardless of languages and differences in formats. Although the proposed approach was demonstrated on Japanese Ukiyo-e databases, it might be applicable to other disciplines for bridging the gaps between databases in different languages.
  • Keywords
    meta data; natural language processing; text analysis; visual databases; Japanese Ukiyo-e image database; languages; metadata elements; text values; textual data; Art; Couplings; Databases; Educational institutions; Libraries; Measurement; Organizations; Japanese arts; de-duplication; digital library; humanities databases; multilingual record linkage;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on
  • Conference_Location
    London
  • Type

    conf

  • DOI
    10.1109/JCDL.2014.6970167
  • Filename
    6970167