DocumentCode :
2871947
Title :
Divide and Translate Legal Text Sentence by Using Its Logical Structure
Author :
Bui Thanh Hung ; Nguyen Le Minh ; Shimazu, A.
Author_Institution :
Grad. Sch. of Inf. Sci., Japan Adv. Inst. of Sci. & Technol., Nomi, Japan
fYear :
2012
fDate :
8-10 Nov. 2012
Firstpage :
18
Lastpage :
23
Abstract :
Translating legal text is generally considered to be difficult because legal text has some characteristics that make it different from other daily-use documents and legal text is usually long and complicated. In order boost the legal text translation quality, splitting an input sentence becomes mandatory. In this paper, we propose a novel method based on the logical structure of legal text sentence for dividing and translating legal text. We use a statistical learning method-Conditional Random Fields (CRFs) with rich linguistic information to recognize the logical structure of legal text sentence. We adapt the logical structure of legal text sentence to divide the sentence. By doing so, translation quality improves. Our experiments show that our approach can achieve better result for both Japanese-English and English-Japanese legal text translation by BLEU, NIST and TER score.
Keywords :
computational linguistics; language translation; learning (artificial intelligence); linguistics; statistical analysis; text analysis; BLEU; English Japanese legal text translation; Japanese English; NIST; TER score; conditional random fields; input sentence; legal text sentence; legal text translation quality; linguistic information; logical structure; statistical learning; Law; Pensions; Pragmatics; Standards; Text recognition; Training; CRFs; logical structure of legal text sentence; phrase-based machine translation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Knowledge, Information and Creativity Support Systems (KICSS), 2012 Seventh International Conference on
Conference_Location :
Melbourne, VIC
Print_ISBN :
978-1-4673-4564-4
Type :
conf
DOI :
10.1109/KICSS.2012.19
Filename :
6405605
Link To Document :
بازگشت