• DocumentCode
    3316840
  • Title

    Improving collocation extraction by using syntactic patterns

  • Author

    Xu, Ruifeng ; Lu, Qin

  • Author_Institution
    Dept. of Comput., Hong Kong Polytech. Univ., Kowloon, China
  • fYear
    2005
  • fDate
    30 Oct.-1 Nov. 2005
  • Firstpage
    52
  • Lastpage
    57
  • Abstract
    A study on using syntactic patterns to improve window-based collocation extraction systems is presented. The support collocation patterns and reject collocation patterns retrieved from a chunked corpus and are used in two different strategies. The first strategy uses only the support patterns in preprocessing stage whereas the second strategy incorporates both the support and the reject patterns into the existing systems. Experimental results show that the use of syntactic patterns can significantly improve the performance of collocation extraction especially for filtering out pseudo collocations and the extraction of low-occurrence collocations.
  • Keywords
    computational linguistics; feature extraction; natural languages; pattern recognition; low-occurrence collocation; pseudo collocation; syntactic patterns; window-based collocation extraction system; Data mining; Frequency; Magnetic heads; Natural language processing; Natural languages; Statistical analysis; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
  • Print_ISBN
    0-7803-9361-9
  • Type

    conf

  • DOI
    10.1109/NLPKE.2005.1598706
  • Filename
    1598706