DocumentCode
168448
Title
An approach to named entity extraction from historical documents in traditional mongolian script
Author
Batjargal, Biligsaikhan ; Khaltarkhuu, Garmaabazar ; Kimura, Fumitaka ; Maeda, Atsushi
Author_Institution
Kinugasa Res. Organ., Ritsumeikan Univ., Kyoto, Japan
fYear
2014
fDate
8-12 Sept. 2014
Firstpage
489
Lastpage
490
Abstract
In this poster, we propose an information extraction method for digitized ancient Mongolian documents by utilizing an ancient-modern dictionary. Named entities such as historical figures and place names will be extracted by employing text mining techniques that aim to reduce the labor-intensive annotation on historical text. The Text Encoding Initiative (TEI) guidelines will be applied to digital text representations that encode the historical figures and place names along with their interpretations, and commentaries.
Keywords
data mining; history; information retrieval; text analysis; TEI guidelines; Text Encoding Initiative; ancient-modern dictionary; digital text representations; digitized ancient Mongolian documents; historical documents; information extraction method; named entity extraction; text mining techniques; traditional Mongolian script; Dictionaries; Educational institutions; Libraries; Text analysis; Visualization; digital library; historical documents; named entity extraction; traditional Mongolian script;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on
Conference_Location
London
Type
conf
DOI
10.1109/JCDL.2014.6970239
Filename
6970239
Link To Document