DocumentCode
172570
Title
Discovering linguistic knowledge by converting printed dictionaries of minority languages into machine readable dictionaries
Author
Ranaivo-Malancon, Bali ; Saee, Suhaila ; Wilfred Busu, Jennifer Fiona
Author_Institution
Fac. of Comput. Sci. & Inf. Technol., Univ. Malaysia Sarawak, Kota Samarahan, Malaysia
fYear
2014
fDate
20-22 Oct. 2014
Firstpage
140
Lastpage
143
Abstract
The goal of the project presented in this paper is to explore the linguistic knowledge hidden in printed dictionaries of minority languages. Firstly, the printed dictionary has to be converted into a machine readable dictionary. The second step is to make use of existing language processing tools to discover the hidden knowledge. To illustrate the proposed idea, a version of an English-Penan dictionary is used as the case-study. It appears that even with a small amount of data, some interesting information, like the first list of functional words, some collocations, and an insight of the morphological structure of the Penan language can be discovered.
Keywords
dictionaries; linguistics; natural language processing; English-Penan dictionary; language processing tools; linguistic knowledge discovery; machine readable dictionary; minority languages printed dictionaries; Dictionaries; Manuals; Microstructure; Natural language processing; Optical character recognition software; Pragmatics; Robustness; Penan language; machine readable dictionary; minority languages; printed dictionary;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2014 International Conference on
Conference_Location
Kuching
Type
conf
DOI
10.1109/IALP.2014.6973522
Filename
6973522
Link To Document