Title :
Subject-Oriented Classification Based on Scale Probing in the Deep Web
Author :
Nie, Tiezheng ; Shen, Derong ; Yu, Ge ; Kou, Yue
Abstract :
To access the large-scale data sources efficiently and automatically, it is necessary to classify these data sources into different domains and categories. In this paper, we propose a novel classification approach to classify data sources into detail domain subjects by query probing. In our approach, we train sample instances for each subject category and use them to probe the data scale of each source and category. And then we build a matrix to classify a data source into one or more subject categories and develop a decision algorithm based on probing iteration to rectify the classification result. Our experiments over real deep web sources show that our approach can achieve higher accuracy across a variety of data sources.
Keywords :
Internet; database management systems; query processing; decision algorithm; large-scale data sources; scale probing; subject category; subject-oriented classification; Books; Computer science; Data mining; Databases; Information management; Large-scale systems; Motion pictures; Probes; Proposals; Web pages; classification; deep web; probing; subject-oriented;
Conference_Titel :
Web-Age Information Management, 2008. WAIM '08. The Ninth International Conference on
Conference_Location :
Zhangjiajie Hunan
Print_ISBN :
978-0-7695-3185-4
Electronic_ISBN :
978-0-7695-3185-4
DOI :
10.1109/WAIM.2008.85