• DocumentCode
    2376037
  • Title

    Finding comparable temporal categorical records: A similarity measure with an interactive visualization

  • Author

    Wongsuphasawat, Krist ; Shneiderman, Ben

  • Author_Institution
    Dept. of Comput. Sci. & Human-Comput. Interaction Lab., Univ. of Maryland, College Park, MD, USA
  • fYear
    2009
  • fDate
    12-13 Oct. 2009
  • Firstpage
    27
  • Lastpage
    34
  • Abstract
    An increasing number of temporal categorical databases are being collected: Electronic Health Records in healthcare organizations, traffic incident logs in transportation systems, or student records in universities. Finding similar records within these large databases requires effective similarity measures that capture the searcher´s intent. Many similarity measures exist for numerical time series, but temporal categorical records are different. We propose a temporal categorical similarity measure, the M&M (Match & Mismatch) measure, which is based on the concept of aligning records by sentinel events, then matching events between the target and the compared records. The M&M measure combines the time differences between pairs of events and the number of mismatches. To accom-modate customization of parameters in the M&M measure and results interpretation, we implemented Similan, an interactive search and visualization tool for temporal categorical records. A usability study with 8 participants demonstrated that Similan was easy to learn and enabled them to find similar records, but users had difficulty understanding the M&M measure. The usability study feedback, led to an improved version with a continuous timeline, which was tested in a pilot study with 5 participants.
  • Keywords
    data visualisation; information retrieval; interactive systems; temporal databases; time series; very large databases; M&M measure; Match & Mismatch measure; Similan; interactive search tool; interactive visualization tool; large databases; numerical time series; parameters customization; similarity measure; temporal categorical databases; temporal categorical records; Educational institutions; Feedback; Medical services; Particle measurements; Testing; Time measurement; Transportation; Usability; Visual databases; Visualization; M&M Measure; Similan; Similarity Search; Temporal Categorical Records;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Visual Analytics Science and Technology, 2009. VAST 2009. IEEE Symposium on
  • Conference_Location
    Atlantic City, NJ
  • Print_ISBN
    978-1-4244-5283-5
  • Type

    conf

  • DOI
    10.1109/VAST.2009.5332595
  • Filename
    5332595