DocumentCode :
3102158
Title :
Statistical termhood measurement for mono-word terms via corpus comparison
Author :
Liu, Xiao-yue ; Kit, Chunyu
Author_Institution :
Dept. of Chinese, Translation & Linguistics, City Univ. of Hong Kong, Kowloon, China
Volume :
6
fYear :
2009
fDate :
12-15 July 2009
Firstpage :
3499
Lastpage :
3504
Abstract :
This paper examines the performance of a number of statistical measures for mono-word termhood within a corpus comparison framework. These measures are defined in terms of the frequency, information, and rank of a term candidate in a domain and a background corpus. The evaluation results from our experiments reveal interesting characteristics of each metric and verify the outstanding performance of those based on enhanced rank and information in identifying true terms.
Keywords :
data mining; information analysis; natural language interfaces; corpus comparison; mono-word terms; statistical termhood measurement; Cybernetics; Data mining; Education; Frequency measurement; Information technology; Knowledge transfer; Machine learning; Natural language processing; Research and development; Terminology; Automatic term recognition; Background corpus; Corpus comparison; Termhood measure;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2009 International Conference on
Conference_Location :
Baoding
Print_ISBN :
978-1-4244-3702-3
Electronic_ISBN :
978-1-4244-3703-0
Type :
conf
DOI :
10.1109/ICMLC.2009.5212765
Filename :
5212765
Link To Document :
بازگشت