• DocumentCode
    172573
  • Title

    Hybrid approach for aligning parallel sentences for languages without a written form using standard Malay and Malay dialects

  • Author

    Khaw, Yen-Min Jasmina ; Tien-Ping Tan

  • Author_Institution
    Sch. of Comput. Sci., Univ. Sains Malaysia, Minden, Malaysia
  • fYear
    2014
  • fDate
    20-22 Oct. 2014
  • Firstpage
    170
  • Lastpage
    174
  • Abstract
    Alignment of parallel text is a step for building a machine translation. Parallel text alignment is important because linguistic information can be retrieved from the result of alignment which including bilingual dictionaries and grammars correspondence of each language. In this paper, we propose a hybrid approach for standard Malay-dialectal Malay parallel text alignment. The Malay dialects in Malaysia can be grouped according to the states such as Perak dialect, Kedah dialect and Terengganu dialect. It is important to learn Malay dialects as it is still flourished and widely used in many areas especially for unofficial matters. Kelantanese Malay is used as an example for dialectal Malay in this research. The obtained precision and recall values of the proposed alignment methods are above 90%.
  • Keywords
    dictionaries; grammars; language translation; linguistics; natural language processing; text analysis; Kedah dialect; Kelantanese Malay; Malaysia; Perak dialect; Terengganu dialect; bilingual dictionaries; grammars; hybrid approach; linguistic information; machine translation; parallel sentences alignment; standard Malay-dialectal Malay parallel text alignment; Dictionaries; Pragmatics; Probability; Reliability; Speech; Standards; Vocabulary; Kelantanese Malay; linguistic information; parallel text alignment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2014 International Conference on
  • Conference_Location
    Kuching
  • Type

    conf

  • DOI
    10.1109/IALP.2014.6973524
  • Filename
    6973524