DocumentCode :
1849516
Title :
Recognizing logical parts in Vietnamese legal texts using Conditional Random Fields
Author :
Nguyen Truong Son ; Ho Bao Quoc ; Nguyen Thi Phuong Duyen ; Nguyen Le Minh
Author_Institution :
Fac. of Inf. Technol., Univ. of Sci., VNU, Ho Chi Minh City, Vietnam
fYear :
2015
fDate :
25-28 Jan. 2015
Firstpage :
1
Lastpage :
6
Abstract :
Analyzing the structure of legal sentences in legal document is an important phase to build a knowledge management system in Legal Engineering. This paper proposes a new approach to recognize logical parts in Vietnamese legal documents based on a statistic machine learning method - Conditional Random Fields. Beside linguistic features such as word features, part of speech features, we use semantic features of logical parts such as trigger features and ontology features to improve the result of the annotation system. Experiments were conducted in a Vietnamese Business Law data set and obtained 78.12% at precision and 68.72% at recall measure. Compare to state-of-the-art systems, it improves the result for recognizing some logical parts.
Keywords :
knowledge management; law administration; learning (artificial intelligence); ontologies (artificial intelligence); text analysis; Vietnamese business law data set; Vietnamese legal documents; Vietnamese legal texts; annotation system; conditional random fields; linguistic features; logical parts recognition; ontology features; part of speech features; statistic machine learning method; trigger features; word features; Dictionaries; Hidden Markov models; Law; Ontologies; Text recognition; Training; conditional random field; legal text mining; machine learning; named entities recognition; semantic annotation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing & Communication Technologies - Research, Innovation, and Vision for the Future (RIVF), 2015 IEEE RIVF International Conference on
Conference_Location :
Can Tho
Print_ISBN :
978-1-4799-8043-7
Type :
conf
DOI :
10.1109/RIVF.2015.7049865
Filename :
7049865
Link To Document :
بازگشت