• DocumentCode
    3169363
  • Title

    Reverse Engineering CAPTCHAs

  • Author

    Hindle, Abram ; Godfrey, Michael W. ; Holt, Richard C.

  • Author_Institution
    Software Archit. Group, Univ. of Waterloo, Waterloo, ON
  • fYear
    2008
  • fDate
    15-18 Oct. 2008
  • Firstpage
    59
  • Lastpage
    68
  • Abstract
    CAPTCHAs are automated Turing tests used to determine if the end-user is human and not an automated program. Users are asked to read and answer Visual CAPTCHAs, which often appear as bitmaps of text characters, in order to gain access to a low-cost resource such as webmail or a blog. CAPTCHAs are generated by software and the structure of a CAPTCHA gives hints to its implementation. Thus due to these properties of image processing and image composition, the process that creates CAPTCHAs can often be reverse engineered. Once the implementation strategy of a family of CAPTCHAs has been reverse engineered the CAPTCHA instances may be solved automatically by leveraging weaknesses in the creation process or by comparing a CAPTCHA´s output against itself. In this paper, we present a case study where we reverse engineer and solve real-world CAPTCHAs using simple image processing techniques such as bitmap comparison, thresholding, fill-flood segmentation, dilation, and erosion. We present black-box and white-box methodologies for reverse engineering and solving CAPTCHAs. As well we provide an open source toolkit for solving CAPTCHAs that we have used with a success rates of 99, 95, 61, 30%, and 27% on hundreds of CAPTCHAs from five real-world examples.
  • Keywords
    Turing machines; image segmentation; reverse engineering; CAPTCHA; automated Turing test; bitmap comparison; dilation; erosion; fill-flood segmentation; image processing technique; reverse engineering; thresholding; Artificial intelligence; Automatic testing; Humans; Image processing; Information services; Internet; Reverse engineering; Software architecture; Software testing; Web sites; CAPTCHA; Reverse Engineering; image processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reverse Engineering, 2008. WCRE '08. 15th Working Conference on
  • Conference_Location
    Antwerp
  • ISSN
    1095-1350
  • Print_ISBN
    978-0-7695-3429-9
  • Type

    conf

  • DOI
    10.1109/WCRE.2008.35
  • Filename
    4656394