DocumentCode
2697468
Title
Deactivation of Unwelcomed Deep Web Extraction Services through Random Injection
Author
Bhagwan, Varun ; Grandison, Tyrone
Author_Institution
IBM Almaden Res. Center, San Jose, CA, USA
fYear
2009
fDate
6-10 July 2009
Firstpage
1014
Lastpage
1015
Abstract
Web sites serve content both through Web services as well as through user-viewable Web pages. While the consumers of Web-services are typically ´machines´, Web pages are meant for human users. It is highly desirable (for reasons of security, revenue, ownership, availability etc.) for service providers that content that will undergo further processing be fetched in a prescribed fashion, preferably through a supplied Web services. In fact, monetization of partnerships within a services ecosystem normally means that Web site data translate into valuable revenue. Unfortunately, it is quite commonplace for arbitrary developers to extract or leverage information from websites without asking for permission and or negotiating a revenue sharing agreement. This may translate to significant lost income for content providers. Even in cases where Web site owners are happy to share the data, they may want users to adopt dedicated Web service APIs (and associated API-servers) rather than putting a load on their revenue-generating websites. In this paper, we introduce a mechanism that disables automated Web scraping agents, thus forcing clients to conform to the provided Web Services.
Keywords
Web services; Web sites; application program interfaces; financial management; information retrieval; multi-agent systems; API; Web services; application program interface; automated Web scraping agents; random injection; revenue-generating Web sites; unwelcomed deep Web extraction services; user-viewable Web pages; Availability; Data mining; Data security; HTML; Humans; Navigation; Switches; USA Councils; Web server; Web services; Deep Web Extraction; Random Injection;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Services, 2009. ICWS 2009. IEEE International Conference on
Conference_Location
Los Angeles, CA
Print_ISBN
978-0-7695-3709-2
Type
conf
DOI
10.1109/ICWS.2009.130
Filename
5175930
Link To Document