Title :
Exploiting ontology for retrieving data behind searchable web forms
Author :
El-desoky, A.I. ; Abd El-Gwad, A.O. ; Okasha, M.E.
Author_Institution :
Dept. of Comput. & Syst., Mansoura Univ., Mansoura
Abstract :
With the virtually unlimited amount of information sources, search engines cannot find or index a large part of these information because they are located behind HTML forms. That part of Web are usually known as hidden Web or deep Web and because the traditional crawlers lack the suitable technique to past HTML forms, many hidden Web crawlers try to beat the problem of retrieving data behind forms. This paper addresses the challenges of form processing and response analysis, through introducing a new technique for automatically fill the searchable forms with using the suitable ontology, then automatically generate and submit the queries and finally analyze the response pages. This technique enhances the performance of hidden Web crawlers in terms of precision, recall, time and cost. By experimenting the proposed technique over the real Web, the results was very promising.
Keywords :
Internet; ontologies (artificial intelligence); query processing; search engines; HTML form; data retrieval; deep Web; hidden Web crawlers; information sources; ontology; search engines; searchable Web form; Information retrieval; Ontologies; Web searching; automatic query generation; deep web; hidden web crawler; information retrieval(IR);
Conference_Titel :
Networking and Media Convergence, 2009. ICNM 2009. International Conference on
Conference_Location :
Cairo
Print_ISBN :
978-1-4244-3776-4
Electronic_ISBN :
978-1-4244-3778-8
DOI :
10.1109/ICNM.2009.4907197