Title :
Hybrid approach for aligning parallel sentences for languages without a written form using standard Malay and Malay dialects
Author :
Khaw, Yen-Min Jasmina ; Tien-Ping Tan
Author_Institution :
Sch. of Comput. Sci., Univ. Sains Malaysia, Minden, Malaysia
Abstract :
Alignment of parallel text is a step for building a machine translation. Parallel text alignment is important because linguistic information can be retrieved from the result of alignment which including bilingual dictionaries and grammars correspondence of each language. In this paper, we propose a hybrid approach for standard Malay-dialectal Malay parallel text alignment. The Malay dialects in Malaysia can be grouped according to the states such as Perak dialect, Kedah dialect and Terengganu dialect. It is important to learn Malay dialects as it is still flourished and widely used in many areas especially for unofficial matters. Kelantanese Malay is used as an example for dialectal Malay in this research. The obtained precision and recall values of the proposed alignment methods are above 90%.
Keywords :
dictionaries; grammars; language translation; linguistics; natural language processing; text analysis; Kedah dialect; Kelantanese Malay; Malaysia; Perak dialect; Terengganu dialect; bilingual dictionaries; grammars; hybrid approach; linguistic information; machine translation; parallel sentences alignment; standard Malay-dialectal Malay parallel text alignment; Dictionaries; Pragmatics; Probability; Reliability; Speech; Standards; Vocabulary; Kelantanese Malay; linguistic information; parallel text alignment;
Conference_Titel :
Asian Language Processing (IALP), 2014 International Conference on
Conference_Location :
Kuching
DOI :
10.1109/IALP.2014.6973524