• DocumentCode
    3717316
  • Title

    Plot arceology: A vector-space model of narrative structure

  • Author

    Benjamin M. Schmidt

  • Author_Institution
    Department of History
  • fYear
    2015
  • Firstpage
    1667
  • Lastpage
    1672
  • Abstract
    A novel and important corpus of about 80,000 television and movie scripts from opensubtitles.com shows interesting large-scale patterns of narration in their vocabulary use. These patterns are interesting at the token level but not easily amenable for large scale data analysis. This paper describes a new method, "plot arcs," for describing and comparing structural elements of structure, including plot, across large textual corpora by treating texts as paths through a multidimensional space derived from a topic model. Plot arcs offer a framework for describing the structure of text documents that is easily extensible to a variety of genres and can accommodate many different ideas of plot structure.
  • Keywords
    "TV","Market research","Motion pictures","Animals","Vocabulary","Data analysis","Trajectory"
  • Publisher
    ieee
  • Conference_Titel
    Big Data (Big Data), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/BigData.2015.7363937
  • Filename
    7363937