DocumentCode
658624
Title
A Supervised Method to Enhance Vocabulary with the Creation of Domain Specific Lexica
Author
Fernandes, Paulo ; Furquim, Luis O. C. ; Lopes, Luis
Author_Institution
Comput. Sci. Dept. - FACIN, PUCRS Univ., Porto Alegre, Brazil
Volume
3
fYear
2013
fDate
17-20 Nov. 2013
Firstpage
139
Lastpage
142
Abstract
This paper proposes a method to enhance lexica by processing domain specific corpora. The proposed method relies on the identification of the more relevant unknown terms in each domain corpus. The innovative points of the proposed approach is to automatically detect unknown terms using MTMDD technology to handle lexical structures, and to automatically rank and identify domain specific terms using gini and tf-dcf indices. The proposed method is experimented in six corpora in order to illustrate its benefits.
Keywords
computational linguistics; indexing; vocabulary; MTMDD technology; domain corpus; domain specific corpora; domain specific lexica; domain specific terms; enhance vocabulary; gini indix; lexical structures; supervised method; tf-dcf index; Computer science; Context; Dictionaries; Indexes; Ontologies; Standards; Vocabulary; information retrieval; natural language processing; term extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2013 IEEE/WIC/ACM International Joint Conferences on
Conference_Location
Atlanta, GA
Print_ISBN
978-1-4799-2902-3
Type
conf
DOI
10.1109/WI-IAT.2013.168
Filename
6690713
Link To Document