• DocumentCode
    140758
  • Title

    Incremental discovery of prominent situational facts

  • Author

    Sultana, Ayesha ; Hassan, Norfaeza ; Chengkai Li ; Jun Yang ; Cong Yu

  • Author_Institution
    Univ. of Texas at Arlington, Arlington, TX, USA
  • fYear
    2014
  • fDate
    March 31 2014-April 4 2014
  • Firstpage
    112
  • Lastpage
    123
  • Abstract
    We study the novel problem of finding new, prominent situational facts, which are emerging statements about objects that stand out within certain contexts. Many such facts are newsworthy-e.g., an athlete´s outstanding performance in a game, or a viral video´s impressive popularity. Effective and efficient identification of these facts assists journalists in reporting, one of the main goals of computational journalism. Technically, we consider an ever-growing table of objects with dimension and measure attributes. A situational fact is a “contextual” skyline tuple that stands out against historical tuples in a context, specified by a conjunctive constraint involving dimension attributes, when a set of measure attributes are compared. New tuples are constantly added to the table, reflecting events happening in the real world. Our goal is to discover constraint-measure pairs that qualify a new tuple as a contextual skyline tuple, and discover them quickly before the event becomes yesterday´s news. A brute-force approach requires exhaustive comparison with every tuple, under every constraint, and in every measure subspace. We design algorithms in response to these challenges using three corresponding ideas-tuple reduction, constraint pruning, and sharing computation across measure subspaces. We also adopt a simple prominence measure to rank the discovered facts when they are numerous. Experiments over two real datasets validate the effectiveness and efficiency of our techniques.
  • Keywords
    data mining; information retrieval; learning (artificial intelligence); brute-force approach; computational journalism; conjunctive constraint; constraint pruning; constraint-measure pairs; contextual skyline tuple; dimension attributes; fact identification; historical tuple; incremental discovery; measure attributes; prominent situational facts; sharing computation; tuple reduction; Algorithm design and analysis; Context; Databases; Extraterrestrial measurements; Games; Lattices; Media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2014 IEEE 30th International Conference on
  • Conference_Location
    Chicago, IL
  • Type

    conf

  • DOI
    10.1109/ICDE.2014.6816644
  • Filename
    6816644