• DocumentCode
    589199
  • Title

    Detecting Web Robots Using Resource Request Patterns

  • Author

    Doran, Derek ; Gokhale, S.S.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Connecticut, Storrs, CT, USA
  • Volume
    1
  • fYear
    2012
  • fDate
    12-15 Dec. 2012
  • Firstpage
    7
  • Lastpage
    12
  • Abstract
    A significant proportion of Web traffic is now attributed to Web robots, and this proportion is likely to grow over time. These robots may threaten the security, privacy, functionality, and performance of a Web server due to their unregulated crawling behavior. Therefore, to assess their impact, it must be possible to accurately detect Web robot requests. Contemporary detection approaches, however, may cease to be effective as the behavior of both robots and humans evolves. In this paper, we present a novel detection approach that is based on the contrasts in the resource request patterns of robots and humans. The proposed scheme, which relies on an invariant behavioral difference between humans and robots, builds on the lessons from contemporary approaches. We demonstrate that the proposed approach can accurately detect Web robots and argue that it is expected to remain effective even as they continue their rapid evolution.
  • Keywords
    Internet; computer network security; Web robots; Web server; Web traffic; crawling behavior; invariant behavioral difference; resource request pattern; Browsers; Detectors; Humans; Measurement; Robots; Training; Web servers; Detection; User Classification; Web Log Analysis; Web Mining; Web crawler; Web robot;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2012 11th International Conference on
  • Conference_Location
    Boca Raton, FL
  • Print_ISBN
    978-1-4673-4651-1
  • Type

    conf

  • DOI
    10.1109/ICMLA.2012.11
  • Filename
    6406581