Title :
An approach to named entity extraction from historical documents in traditional mongolian script
Author :
Batjargal, Biligsaikhan ; Khaltarkhuu, Garmaabazar ; Kimura, Fumitaka ; Maeda, Atsushi
Author_Institution :
Kinugasa Res. Organ., Ritsumeikan Univ., Kyoto, Japan
Abstract :
In this poster, we propose an information extraction method for digitized ancient Mongolian documents by utilizing an ancient-modern dictionary. Named entities such as historical figures and place names will be extracted by employing text mining techniques that aim to reduce the labor-intensive annotation on historical text. The Text Encoding Initiative (TEI) guidelines will be applied to digital text representations that encode the historical figures and place names along with their interpretations, and commentaries.
Keywords :
data mining; history; information retrieval; text analysis; TEI guidelines; Text Encoding Initiative; ancient-modern dictionary; digital text representations; digitized ancient Mongolian documents; historical documents; information extraction method; named entity extraction; text mining techniques; traditional Mongolian script; Dictionaries; Educational institutions; Libraries; Text analysis; Visualization; digital library; historical documents; named entity extraction; traditional Mongolian script;
Conference_Titel :
Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on
Conference_Location :
London
DOI :
10.1109/JCDL.2014.6970239