مرکز منطقه ای اطلاع رساني علوم و فناوري - Automatic Web Query Classification Using Large Unlabeled Web Pages

DocumentCode :

2548420

Title :

Automatic Web Query Classification Using Large Unlabeled Web Pages

Author :

Jingbo Yu ; Na Ye

Author_Institution :

Beijing Inst. of Graphic Commun., Beijing

fYear :

2008

fDate :

20-22 July 2008

Firstpage :

211

Lastpage :

215

Abstract :

In this paper, a novel and simple method is employed to automatically construct domain knowledge base for query classification from large-scale Web pages. Besides, using context as the feature of words, the resource of relevant words is built automatically in order to extend the user´s query. On the basis of domain knowledge base and extension of the query using relevant words, satisfactory performance in query classification is achieved. Experimental results demonstrate that our method achieves precision of 77.68% and recall of 75.34% in Chinese query classification. In English experiments, in spite of the scarcity of English Web pages and absence of stemming, precision achieves 58.83% and recall achieves 54.13%, which is a great improvement compared to state-of-the-art query classification algorithms.

Keywords :

Internet; classification; data mining; knowledge based systems; query processing; automatic Web query classification; automatic domain knowledge base construction; data mining; large-scale unlabeled Web page; Information management; Web pages; Relevant Word; query classification; the domain knowledge base;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Web-Age Information Management, 2008. WAIM '08. The Ninth International Conference on

Conference_Location :

Zhangjiajie Hunan

Print_ISBN :

978-0-7695-3185-4

Electronic_ISBN :

978-0-7695-3185-4

Type :

conf

DOI :

10.1109/WAIM.2008.91

Filename :

4597016

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2548420