DocumentCode :
172570
Title :
Discovering linguistic knowledge by converting printed dictionaries of minority languages into machine readable dictionaries
Author :
Ranaivo-Malancon, Bali ; Saee, Suhaila ; Wilfred Busu, Jennifer Fiona
Author_Institution :
Fac. of Comput. Sci. & Inf. Technol., Univ. Malaysia Sarawak, Kota Samarahan, Malaysia
fYear :
2014
fDate :
20-22 Oct. 2014
Firstpage :
140
Lastpage :
143
Abstract :
The goal of the project presented in this paper is to explore the linguistic knowledge hidden in printed dictionaries of minority languages. Firstly, the printed dictionary has to be converted into a machine readable dictionary. The second step is to make use of existing language processing tools to discover the hidden knowledge. To illustrate the proposed idea, a version of an English-Penan dictionary is used as the case-study. It appears that even with a small amount of data, some interesting information, like the first list of functional words, some collocations, and an insight of the morphological structure of the Penan language can be discovered.
Keywords :
dictionaries; linguistics; natural language processing; English-Penan dictionary; language processing tools; linguistic knowledge discovery; machine readable dictionary; minority languages printed dictionaries; Dictionaries; Manuals; Microstructure; Natural language processing; Optical character recognition software; Pragmatics; Robustness; Penan language; machine readable dictionary; minority languages; printed dictionary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2014 International Conference on
Conference_Location :
Kuching
Type :
conf
DOI :
10.1109/IALP.2014.6973522
Filename :
6973522
Link To Document :
بازگشت