DocumentCode
3133705
Title
A Study on Automatic Extraction of New Terms
Author
Zhang, Xing ; Fang, Alex Chengyu
Author_Institution
Dept. of Chinese, Translation & Linguistics, City Univ. of Hong Kong, Kowloon, China
fYear
2011
fDate
8-9 Oct. 2011
Firstpage
599
Lastpage
602
Abstract
This research explores to automatically predict new terms based on linguistic features and statistical behaviors of noun phrases during a special period. It integrates both syntactic function value and TF-IDF value into an automatic term extraction system to weight new term candidates. Research questions include: what are the linguistic and statistic properties of new terms during a special period? Will linguistic features contribute to prediction of new terms? And will statistic features like, TFIDF Value contribute to prediction of new terms? Correspondingly, a series of experiments are conducted on medical corpus to examine a group of new terms´ distribution properties and syntactic features across two years in comparison. The results show there does exist significant difference between two groups of values. Regardless of this limitation, this research is meaningful as it attempts to realize automation of selection process of new medical terms, which will greatly avoid subjective decisions and reduce experts´ workloads.
Keywords
information retrieval; statistical analysis; TF-IDF value; automatic new term extraction system; linguistic feature; noun phrase behavior; statistical behavior; syntactic function value; Abstracts; Analysis of variance; Equations; Pragmatics; Syntactics; Terminology; Testing; New term; SF-Value; TFIDF; old term; term extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Knowledge Acquisition and Modeling (KAM), 2011 Fourth International Symposium on
Conference_Location
Sanya
Print_ISBN
978-1-4577-1788-8
Type
conf
DOI
10.1109/KAM.2011.162
Filename
6137717
Link To Document