Title :
Auto-POS templates and mixed metrics for recognizing terms in scientific literature
Author :
You, Hongliang ; Zhang, Wei ; Shen, Junyi ; Yu, Yang ; Liu, Ting
Author_Institution :
Sch. of Electr. & Inf. Eng., Xi´´an Jiaotong Univ., Xi´´an, China
Abstract :
Automatic Term Recognition (ATR) is an important task for Knowledge Acquisition, which aims at acquiring formalized words which are not recorded in time in the glossary. In recent years, several statistical methods has proved to be effective, and emerging methods such as C-value, NC-Value, TermExtractor has shown great advantages on this task. However, few works have been done on the Metric mixing algorithm that combines those metrics as a whole. In this paper, we first collect part-of-speech templates from already-known terms automatically, namely Auto-POS templates, instead of artificial regular expressions, and then we match them with POS strings to acquire candidate terms. Finally we sort those candidates by metric mixing algorithm. Experimental results on IEEE2006-2007 metadata show that the metric mixing algorithm performs better than any separate metrics alone.
Keywords :
knowledge acquisition; meta data; scientific information systems; speech recognition; text analysis; Auto-POS templates; IEEE2006-2007 metadata; POS strings; TermExtractor; artificial regular expressions; auto-POS templates; automatic term recognition; knowledge acquisition; metric mixing algorithm; part-of-speech templates; scientific literature; Artificial neural networks; Logic gates; Measurement; Terminology; Automatic Term Recognition; Information Extraction; Knowledge Acquisition; Text Mining;
Conference_Titel :
Knowledge Acquisition and Modeling (KAM), 2010 3rd International Symposium on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-8004-3
DOI :
10.1109/KAM.2010.5646319