Title :
Bitext alignment for the English-Ukrainian language pair
Author :
Paliy, Zoryana ; Romanyuk, A.
Author_Institution :
Dept. of Appl. Linguistics, Lviv Polytech. Nat. Univ., Lviv, Ukraine
Abstract :
In this paper, the problem of bitext alignment as the main stage of a parallel corpus composition is addressed. Three automatic alignment algorithms are chosen for evaluation in their program implementations. Bitexts for the alignment are composed of a total of 736 sentence pairs (28,303 words) from English fiction texts and their corresponding translations. Automatic alignment of the chosen bitexts is performed and evaluated. The results of the aligners´ performance are summarized and discussed.
Keywords :
natural language processing; text analysis; English fiction texts; English-Ukrainian language pair; automatic alignment algorithms; bitext alignment; parallel corpus composition; Books; CADCAM; Computer aided manufacturing; Databases; Encoding; Error analysis; Humans; Natural language processing; Performance evaluation; automatic aligner; bitext; bitext alignment; parallel corpus;
Conference_Titel :
CAD Systems in Microelectronics, 2009. CADSM 2009. 10th International Conference - The Experience of Designing and Application of
Conference_Location :
Lviv-Polyana
Print_ISBN :
978-966-2191-05-9