DocumentCode :
3248222
Title :
Auto-POS templates and mixed metrics for recognizing terms in scientific literature
Author :
You, Hongliang ; Zhang, Wei ; Shen, Junyi ; Yu, Yang ; Liu, Ting
Author_Institution :
Sch. of Electr. & Inf. Eng., Xi´´an Jiaotong Univ., Xi´´an, China
fYear :
2010
fDate :
20-21 Oct. 2010
Firstpage :
84
Lastpage :
87
Abstract :
Automatic Term Recognition (ATR) is an important task for Knowledge Acquisition, which aims at acquiring formalized words which are not recorded in time in the glossary. In recent years, several statistical methods has proved to be effective, and emerging methods such as C-value, NC-Value, TermExtractor has shown great advantages on this task. However, few works have been done on the Metric mixing algorithm that combines those metrics as a whole. In this paper, we first collect part-of-speech templates from already-known terms automatically, namely Auto-POS templates, instead of artificial regular expressions, and then we match them with POS strings to acquire candidate terms. Finally we sort those candidates by metric mixing algorithm. Experimental results on IEEE2006-2007 metadata show that the metric mixing algorithm performs better than any separate metrics alone.
Keywords :
knowledge acquisition; meta data; scientific information systems; speech recognition; text analysis; Auto-POS templates; IEEE2006-2007 metadata; POS strings; TermExtractor; artificial regular expressions; auto-POS templates; automatic term recognition; knowledge acquisition; metric mixing algorithm; part-of-speech templates; scientific literature; Artificial neural networks; Logic gates; Measurement; Terminology; Automatic Term Recognition; Information Extraction; Knowledge Acquisition; Text Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Knowledge Acquisition and Modeling (KAM), 2010 3rd International Symposium on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-8004-3
Type :
conf
DOI :
10.1109/KAM.2010.5646319
Filename :
5646319
Link To Document :
بازگشت