• DocumentCode
    172570
  • Title

    Discovering linguistic knowledge by converting printed dictionaries of minority languages into machine readable dictionaries

  • Author

    Ranaivo-Malancon, Bali ; Saee, Suhaila ; Wilfred Busu, Jennifer Fiona

  • Author_Institution
    Fac. of Comput. Sci. & Inf. Technol., Univ. Malaysia Sarawak, Kota Samarahan, Malaysia
  • fYear
    2014
  • fDate
    20-22 Oct. 2014
  • Firstpage
    140
  • Lastpage
    143
  • Abstract
    The goal of the project presented in this paper is to explore the linguistic knowledge hidden in printed dictionaries of minority languages. Firstly, the printed dictionary has to be converted into a machine readable dictionary. The second step is to make use of existing language processing tools to discover the hidden knowledge. To illustrate the proposed idea, a version of an English-Penan dictionary is used as the case-study. It appears that even with a small amount of data, some interesting information, like the first list of functional words, some collocations, and an insight of the morphological structure of the Penan language can be discovered.
  • Keywords
    dictionaries; linguistics; natural language processing; English-Penan dictionary; language processing tools; linguistic knowledge discovery; machine readable dictionary; minority languages printed dictionaries; Dictionaries; Manuals; Microstructure; Natural language processing; Optical character recognition software; Pragmatics; Robustness; Penan language; machine readable dictionary; minority languages; printed dictionary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2014 International Conference on
  • Conference_Location
    Kuching
  • Type

    conf

  • DOI
    10.1109/IALP.2014.6973522
  • Filename
    6973522