DocumentCode :
2816622
Title :
Confidence on approximate query in large datasets
Author :
Ford, Charles Wesley ; Chiang, Chia-Chu ; Wu, Hao ; Chilka, Radhika R. ; Talburt, John
Author_Institution :
Dept. of Comput. Sci., Arkansas Univ., Little Rock, AR, USA
Volume :
2
fYear :
2004
fDate :
5-7 April 2004
Firstpage :
480
Abstract :
The evolution of the World Wide Web has brought us enormous amounts of information for business and research use. Design and implementation of an automated system for Web data mining has become important for companies wishing to utilize useful information from the Web. We attempt to describe confidence on approximate queries on large datasets, which is done in the context of an automated system for Web data mining. The system has been designed to identify, extract, filter, and analyze data from Web resources. An approach to evaluating the quality of extracted Web data is also discussed. This is an exploratory study of Web data retrieval and Web data analysis.
Keywords :
Internet; Web sites; data analysis; data mining; information filters; query processing; very large databases; Internet; Web data analysis; Web data extraction; Web data filtering; Web data identification; Web data mining; World Wide Web; approximate query confidence; automated system design; large datasets; Application software; Data analysis; Data mining; Databases; Information filtering; Information filters; Information retrieval; Search engines; Statistics; Web sites;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
Print_ISBN :
0-7695-2108-8
Type :
conf
DOI :
10.1109/ITCC.2004.1286700
Filename :
1286700
Link To Document :
بازگشت