Statistical Learning in Web Search

Author

Li, Hang

Author_Institution

Microsoft Res. Asia, Beijing

fYear

2008

fDate

26-26 April 2008

Firstpage

3

Lastpage

3

Abstract

Search is becoming the major means for people to access the information on the Internet. According to a survey, 55% of web users use search engines every day. Web search engines are built with technologies mainly from two areas, namely, large-scale distributed computing and statistical learning. Statistical learning is useful because there are many uncertainties in crawling, indexing, ranking, and serving of Web search and the solutions have to be data-driven. In this talk, I will explain how statistical learning technologies are being used in web search. I will also introduce some of the statistical learning technologies for web search, which we have developed recently at MSRA. They include BrowseRrank, ranking refinement, query dependent ranking, and query refinement.

Keywords

Internet; learning (artificial intelligence); search engines; statistical analysis; BrowseRrank; Internet; Web search; data-driven methods; large-scale distributed computing; query dependent ranking; query refinement; ranking refinement; search engines; statistical learning; Asia; Distributed computing; Indexing; Internet; Large-scale systems; Search engines; Statistical learning; Uncertainty; Web search;

fLanguage

English

Publisher

ieee

Conference_Titel

Information-Explosion and Next Generation Search, 2008. INGS '08. International Workshop on

Conference_Location

Shenyang

Print_ISBN

978-0-7695-3300-1

Type

conf

DOI

10.1109/INGS.2008.10

Filename

4627225