DocumentCode :
658624
Title :
A Supervised Method to Enhance Vocabulary with the Creation of Domain Specific Lexica
Author :
Fernandes, Paulo ; Furquim, Luis O. C. ; Lopes, Luis
Author_Institution :
Comput. Sci. Dept. - FACIN, PUCRS Univ., Porto Alegre, Brazil
Volume :
3
fYear :
2013
fDate :
17-20 Nov. 2013
Firstpage :
139
Lastpage :
142
Abstract :
This paper proposes a method to enhance lexica by processing domain specific corpora. The proposed method relies on the identification of the more relevant unknown terms in each domain corpus. The innovative points of the proposed approach is to automatically detect unknown terms using MTMDD technology to handle lexical structures, and to automatically rank and identify domain specific terms using gini and tf-dcf indices. The proposed method is experimented in six corpora in order to illustrate its benefits.
Keywords :
computational linguistics; indexing; vocabulary; MTMDD technology; domain corpus; domain specific corpora; domain specific lexica; domain specific terms; enhance vocabulary; gini indix; lexical structures; supervised method; tf-dcf index; Computer science; Context; Dictionaries; Indexes; Ontologies; Standards; Vocabulary; information retrieval; natural language processing; term extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2013 IEEE/WIC/ACM International Joint Conferences on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4799-2902-3
Type :
conf
DOI :
10.1109/WI-IAT.2013.168
Filename :
6690713
Link To Document :
بازگشت