Title :
Automatic term extraction from Chinese scientific texts
Author :
Zheng, Qinghua ; Luo, Junying ; Liu, Jun
Author_Institution :
MOE KLINNS Lab., Xi´´an Jiaotong Univ., Xi´´an, China
Abstract :
Automatic term extraction is an essential task in information processing and has a very important role in many fields, such as information retrieval, knowledge acquisition. However, existing methods are mostly proposed for English domain terms, so they can not fully adapt to the term extraction from Chinese scientific texts. This paper presents a/ new approach on the analysis of the characteristics of Chinese domain terms. Firstly, we introduce a new feature which we call it “max article time” to distinguish terms from non-terms. Then, we use the classification of terms and the links between different terms to obtain the maximum discrimination of this feature. Meanwhile, our method also combines with linguistic methods. Experiments conducted on two different domains for Chinese term extraction indicate our approach has significant improvement over existing techniques and also verify the relative domain independence of the approach.
Keywords :
information retrieval; text analysis; Chinese scientific texts; English domain terms; automatic term extraction; information processing; linguistic methods; max article time; Data mining; Feature extraction; Mutual information; Noise; Operating systems; Pragmatics; Speech; Chinese domain terminology; automatic term extraction; machine learning;
Conference_Titel :
Computer Supported Cooperative Work in Design (CSCWD), 2011 15th International Conference on
Conference_Location :
Lausanne
Print_ISBN :
978-1-4577-0386-7
DOI :
10.1109/CSCWD.2011.5960199