• DocumentCode
    3122091
  • Title

    Contextual Ranking of Keywords Using Click Data

  • Author

    Irmak, Utku ; von Brzeski, V. ; Kraft, Reiner

  • Author_Institution
    Yahoo! Inc., Sunnyvale, CA
  • fYear
    2009
  • fDate
    March 29 2009-April 2 2009
  • Firstpage
    457
  • Lastpage
    468
  • Abstract
    The problem of automatically extracting the most interesting and relevant keyword phrases in a document has been studied extensively as it is crucial for a number of applications. These applications include contextual advertising, automatic text summarization, and user-centric entity detection systems. All these applications can potentially benefit from a successful solution as it enables computational efficiency (by decreasing the input size), noise reduction, or overall improved user satisfaction.In this paper, we study this problem and focus on improving the overall quality of user-centric entity detection systems. First, we review our concept extraction technique, which relies on search engine query logs. We then define a new feature space to represent the interesting ness of concepts, and describe a new approach to estimate their relevancy for a given context. We utilize click through data obtained from a large scale user-centric entity detection system - Contextual Shortcuts - to train a model to rank the extracted concepts, and evaluate the resulting model extensively again based on their click through data. Our results show that the learned model outperforms the baseline model, which employs similar features but whose weights are tuned carefully based on empirical observations, and reduces the error rate from 30.22% to 18.66%.
  • Keywords
    information retrieval; search engines; text analysis; automatic text summarization; click data; concept extraction technique; contextual advertising; contextual keyword ranking; keyword phrase extraction; search engine query logs; user-centric entity detection systems; Advertising; Computational efficiency; Context modeling; Data engineering; Data mining; Noise reduction; Search engines; USA Councils; Uniform resource locators; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
  • Conference_Location
    Shanghai
  • ISSN
    1084-4627
  • Print_ISBN
    978-1-4244-3422-0
  • Electronic_ISBN
    1084-4627
  • Type

    conf

  • DOI
    10.1109/ICDE.2009.76
  • Filename
    4812426