• DocumentCode
    2697468
  • Title

    Deactivation of Unwelcomed Deep Web Extraction Services through Random Injection

  • Author

    Bhagwan, Varun ; Grandison, Tyrone

  • Author_Institution
    IBM Almaden Res. Center, San Jose, CA, USA
  • fYear
    2009
  • fDate
    6-10 July 2009
  • Firstpage
    1014
  • Lastpage
    1015
  • Abstract
    Web sites serve content both through Web services as well as through user-viewable Web pages. While the consumers of Web-services are typically ´machines´, Web pages are meant for human users. It is highly desirable (for reasons of security, revenue, ownership, availability etc.) for service providers that content that will undergo further processing be fetched in a prescribed fashion, preferably through a supplied Web services. In fact, monetization of partnerships within a services ecosystem normally means that Web site data translate into valuable revenue. Unfortunately, it is quite commonplace for arbitrary developers to extract or leverage information from websites without asking for permission and or negotiating a revenue sharing agreement. This may translate to significant lost income for content providers. Even in cases where Web site owners are happy to share the data, they may want users to adopt dedicated Web service APIs (and associated API-servers) rather than putting a load on their revenue-generating websites. In this paper, we introduce a mechanism that disables automated Web scraping agents, thus forcing clients to conform to the provided Web Services.
  • Keywords
    Web services; Web sites; application program interfaces; financial management; information retrieval; multi-agent systems; API; Web services; application program interface; automated Web scraping agents; random injection; revenue-generating Web sites; unwelcomed deep Web extraction services; user-viewable Web pages; Availability; Data mining; Data security; HTML; Humans; Navigation; Switches; USA Councils; Web server; Web services; Deep Web Extraction; Random Injection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Services, 2009. ICWS 2009. IEEE International Conference on
  • Conference_Location
    Los Angeles, CA
  • Print_ISBN
    978-0-7695-3709-2
  • Type

    conf

  • DOI
    10.1109/ICWS.2009.130
  • Filename
    5175930