• DocumentCode
    168448
  • Title

    An approach to named entity extraction from historical documents in traditional mongolian script

  • Author

    Batjargal, Biligsaikhan ; Khaltarkhuu, Garmaabazar ; Kimura, Fumitaka ; Maeda, Atsushi

  • Author_Institution
    Kinugasa Res. Organ., Ritsumeikan Univ., Kyoto, Japan
  • fYear
    2014
  • fDate
    8-12 Sept. 2014
  • Firstpage
    489
  • Lastpage
    490
  • Abstract
    In this poster, we propose an information extraction method for digitized ancient Mongolian documents by utilizing an ancient-modern dictionary. Named entities such as historical figures and place names will be extracted by employing text mining techniques that aim to reduce the labor-intensive annotation on historical text. The Text Encoding Initiative (TEI) guidelines will be applied to digital text representations that encode the historical figures and place names along with their interpretations, and commentaries.
  • Keywords
    data mining; history; information retrieval; text analysis; TEI guidelines; Text Encoding Initiative; ancient-modern dictionary; digital text representations; digitized ancient Mongolian documents; historical documents; information extraction method; named entity extraction; text mining techniques; traditional Mongolian script; Dictionaries; Educational institutions; Libraries; Text analysis; Visualization; digital library; historical documents; named entity extraction; traditional Mongolian script;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on
  • Conference_Location
    London
  • Type

    conf

  • DOI
    10.1109/JCDL.2014.6970239
  • Filename
    6970239