DocumentCode
172573
Title
Hybrid approach for aligning parallel sentences for languages without a written form using standard Malay and Malay dialects
Author
Khaw, Yen-Min Jasmina ; Tien-Ping Tan
Author_Institution
Sch. of Comput. Sci., Univ. Sains Malaysia, Minden, Malaysia
fYear
2014
fDate
20-22 Oct. 2014
Firstpage
170
Lastpage
174
Abstract
Alignment of parallel text is a step for building a machine translation. Parallel text alignment is important because linguistic information can be retrieved from the result of alignment which including bilingual dictionaries and grammars correspondence of each language. In this paper, we propose a hybrid approach for standard Malay-dialectal Malay parallel text alignment. The Malay dialects in Malaysia can be grouped according to the states such as Perak dialect, Kedah dialect and Terengganu dialect. It is important to learn Malay dialects as it is still flourished and widely used in many areas especially for unofficial matters. Kelantanese Malay is used as an example for dialectal Malay in this research. The obtained precision and recall values of the proposed alignment methods are above 90%.
Keywords
dictionaries; grammars; language translation; linguistics; natural language processing; text analysis; Kedah dialect; Kelantanese Malay; Malaysia; Perak dialect; Terengganu dialect; bilingual dictionaries; grammars; hybrid approach; linguistic information; machine translation; parallel sentences alignment; standard Malay-dialectal Malay parallel text alignment; Dictionaries; Pragmatics; Probability; Reliability; Speech; Standards; Vocabulary; Kelantanese Malay; linguistic information; parallel text alignment;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2014 International Conference on
Conference_Location
Kuching
Type
conf
DOI
10.1109/IALP.2014.6973524
Filename
6973524
Link To Document