Title :
URL ordering based performance evaluation of Web crawler
Author :
Shoaib, Mohammed ; Maurya, Ajay Kumar
Author_Institution :
Fac. of Comput. Sci. & Eng., Shri Ramswaroop Memorial Univ., Lucknow, India
Abstract :
There are billions of Web pages on World Wide Web which can be accessed via internet. All of us rely on usage of internet for source of information. This source of information is available on web in various forms such as Websites, databases, images, sound, videos and many more. The search results given by search engine are classified on basis of many techniques such as keyword matches, link analysis, or many other techniques. Search engines provide information gathered from their own indexed databases. These indexed databases contain downloaded information from web pages. Whenever a query is provided by user, the information is fetched from these indexed pages. The Web Crawler is used to download and store web pages. Web crawler of these search engines is expert in crawling various Web pages to gather huge source of information. Web Crawler is developed which orders URLs on the basis of their content similarity to a query and structural similarity. Results are provided over five parameters: Top URLs, Precision, Content, Structural and Total Similarity for a keyword.
Keywords :
Web sites; database indexing; query processing; search engines; Internet; URL ordering; Web crawler; Web pages; Web sites; World Wide Web; content similarity; database indexing; information source; keyword matches; link analysis; performance evaluation; search engines; structural similarity; top URLs; total similarity; Cloud computing; Distributed databases; Medical services; Patient monitoring; Schedules; URL Ordering; Web Crawler; Web Pages;
Conference_Titel :
Advances in Engineering and Technology Research (ICAETR), 2014 International Conference on
Conference_Location :
Unnao
DOI :
10.1109/ICAETR.2014.7012962