• DocumentCode
    1791790
  • Title

    A computational pipeline for crowdsourced transcriptions of Ancient Greek papyrus fragments

  • Author

    Williams, Alex C. ; Wallin, John F. ; Haoyu Yu ; Perale, Marco ; Carroll, Hyrum D. ; Lamblin, Anne-Francoise ; Fortson, Lucy ; Obbink, Dirk ; Lintott, Chris J. ; Brusuelas, James H.

  • Author_Institution
    Middle Tennessee State Univ., Murfreesboro, TN, USA
  • fYear
    2014
  • fDate
    27-30 Oct. 2014
  • Firstpage
    100
  • Lastpage
    105
  • Abstract
    In the late nineteenth century, two excavators from the University of Oxford uncovered a vast trove of naturally deteriorated papyri, numbering over 500,000 fragments, from the city of Oxyrhynchus. With varying levels and forms of deterioration, the identification of a papyrus fragment can become a repetitive, long, and exhausting process for a professional papyrologist. The University of Oxford´s Ancient Lives project aims to accelerate the identification process through citizen science (or crowdsourcing). In the Ancient Lives interface, volunteer users identify letters by clicking on a location in the image to designate the presence of a letter. To date, over 7 million letter identifications from users across the world have been recorded in the Ancient Lives database. In this paper, we present a computational pipeline for converting crowdsourced letter identifications made through the Ancient Lives interface into digital consensus transcriptions of papyrus fragments. We conclude by explaining the usefulness of the pipeline output in the context of additional computational projects that aim to further accelerate the identification process.
  • Keywords
    history; user interfaces; Ancient Lives database; Ancient Lives interface; Ancient Lives project; Oxyrhynchus; University of Oxford; ancient Greek papyrus fragments; citizen science; computational pipeline; crowdsourced letter identifications; crowdsourced transcriptions; identification process; naturally deteriorated papyri; volunteer users; Accuracy; Databases; Digital images; Educational institutions; Kernel; Pipeline processing; Pipelines; big data; crowdsourcing; human computation; papyrus transcription;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data (Big Data), 2014 IEEE International Conference on
  • Conference_Location
    Washington, DC
  • Type

    conf

  • DOI
    10.1109/BigData.2014.7004460
  • Filename
    7004460