• DocumentCode
    3630296
  • Title

    Modeling the frequency of phrasal verbs with search engines

  • Author

    Grazyna Chamielec;Dawid Weiss

  • Author_Institution
    SuperMemo, Poznan, Poland
  • fYear
    2008
  • Firstpage
    381
  • Lastpage
    388
  • Abstract
    There are well over a thousand phrasal verbs in English. For non-native speakers they are notoriously difficult to remember and use in the right context. We tried to construct a ranking of phrasal verbs according to their estimated occurrence frequency, based on quantitative information available from the public indexable Web. Technically, we used major Web search engines to acquire phrase-occurrence statistics, measured consistency between the rankings implied by their results and confirmed that a rough set of ‘classes’ of phrasal verbs can be distinguished. While this technique relies on inaccurate and possibly biased estimation functions, we show that the overall distribution of ranks seems to be consistent among all the queried search engines operated by different vendors.
  • Keywords
    "Search engines","Frequency estimation","Dictionaries","Computer science","Books","Frequency measurement","Natural languages","Information technology","Web search","Statistical distributions"
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology, 2008. IMCSIT 2008. International Multiconference on
  • Print_ISBN
    978-83-60810-14-9
  • Type

    conf

  • DOI
    10.1109/IMCSIT.2008.4747269
  • Filename
    4747269