• DocumentCode
    3133705
  • Title

    A Study on Automatic Extraction of New Terms

  • Author

    Zhang, Xing ; Fang, Alex Chengyu

  • Author_Institution
    Dept. of Chinese, Translation & Linguistics, City Univ. of Hong Kong, Kowloon, China
  • fYear
    2011
  • fDate
    8-9 Oct. 2011
  • Firstpage
    599
  • Lastpage
    602
  • Abstract
    This research explores to automatically predict new terms based on linguistic features and statistical behaviors of noun phrases during a special period. It integrates both syntactic function value and TF-IDF value into an automatic term extraction system to weight new term candidates. Research questions include: what are the linguistic and statistic properties of new terms during a special period? Will linguistic features contribute to prediction of new terms? And will statistic features like, TFIDF Value contribute to prediction of new terms? Correspondingly, a series of experiments are conducted on medical corpus to examine a group of new terms´ distribution properties and syntactic features across two years in comparison. The results show there does exist significant difference between two groups of values. Regardless of this limitation, this research is meaningful as it attempts to realize automation of selection process of new medical terms, which will greatly avoid subjective decisions and reduce experts´ workloads.
  • Keywords
    information retrieval; statistical analysis; TF-IDF value; automatic new term extraction system; linguistic feature; noun phrase behavior; statistical behavior; syntactic function value; Abstracts; Analysis of variance; Equations; Pragmatics; Syntactics; Terminology; Testing; New term; SF-Value; TFIDF; old term; term extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Knowledge Acquisition and Modeling (KAM), 2011 Fourth International Symposium on
  • Conference_Location
    Sanya
  • Print_ISBN
    978-1-4577-1788-8
  • Type

    conf

  • DOI
    10.1109/KAM.2011.162
  • Filename
    6137717