Title :
Automatic recognition of Chinese scientific and technological terms using integrated linguistic knowledge
Author :
Sui, Zhifang ; Yirong Chen ; Wei, Zhouchao
Author_Institution :
Inst. of Comput. Linguistics, Peking Univ., Beijing, China
Abstract :
We introduce our research on using integrated linguistic knowledge to automatically recognize Chinese scientific and technological terms based on the careful analysis of the characteristics of this kind of terms. The system of automatic term recognition includes two phases: learning stage and application stage. In the stage of learning, we use a series of machine learning methods to get various kinds of integrated knowledge for automatic term recognition from a large-scale corpus and a term bank. These knowledge includes the inner structural knowledge of terms, the statistical domain features of term component, the statistical mutual information between the components of terms, the outer environment features of terms and the distinct text-level features of term recognition etc.. In the stage of application, through an efficient model, we use all these various types of knowledge into automatic term recognition. The experiments show that the system can give great help to the expert of term standardization to discover new terms.
Keywords :
character recognition; knowledge based systems; learning (artificial intelligence); linguistics; natural languages; automatic Chinese scientific term recognition; integrated linguistic knowledge; machine learning; term standardization; Biological materials; Bluetooth; Bridges; Business; Character recognition; Computational linguistics; Materials science and technology; Natural languages; Paper technology; Standardization;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location :
Beijing, China
Print_ISBN :
0-7803-7902-0
DOI :
10.1109/NLPKE.2003.1275948