DocumentCode
168306
Title
Human and machine error analysis on dependency parsing of ancient Greek texts
Author
Majidi, Saeed ; Crane, Gregory
Author_Institution
Dept. of Comput. Sci., Tufts Univ., Medford, MA, USA
fYear
2014
fDate
8-12 Sept. 2014
Firstpage
221
Lastpage
224
Abstract
Automatically generated metadata from large collections is an essential component of digital libraries. It is beginning to emerge as fundamental to the study of languages. Morphosyntactic annotation captures the form of individual words and their function. Nonetheless automated syntactic analysis is still imperfect and human annotators can be significantly more accurate. On the other hand, human work is expensive and even humans find some constructions difficult to annotate correctly. Comparing the performance of human annotators with that of an automatic parser is thus important for exploring how the two methods can best be combined. In the present study, we compare the frequency of the different types of errors made by student annotators with those made by different dependency parsers when annotating ancient Greek. With a few exceptions, the frequency of the different types of errors was similar for human and machine. The significance of these results is briefly discussed.
Keywords
digital libraries; grammars; meta data; natural language processing; text analysis; ancient Greek texts; automated syntactic analysis; automatic parser; automatically generated metadata; dependency parsing; digital libraries; human annotators; human error analysis; machine error analysis; morphosyntactic annotation; Correlation; Erbium; Error analysis; Natural languages; Pragmatics; Radio frequency; Syntactics; Dependency parsing; ancient greek treebanking; corpus annotation; data-driven parsing; error analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on
Conference_Location
London
Type
conf
DOI
10.1109/JCDL.2014.6970171
Filename
6970171
Link To Document