• DocumentCode
    3712437
  • Title

    Supporting exploratory data analysis with live programming

  • Author

    Robert DeLine;Danyel Fisher

  • Author_Institution
    Microsoft Research, Redmond, WA, USA
  • fYear
    2015
  • Firstpage
    111
  • Lastpage
    119
  • Abstract
    Data scientists often conduct exploratory data analysis in scripting environments with a read-eval-print loop (REPL), like R, IPython or MATLAB. This user experience requires diligent management of execution and generates lengthy histories of unwanted command responses. This paper explores the alternative of live programming, a user experience in which the user´s edits immediately and automatically update the script results-a “ripple” effect familiar from spreadsheets. Which user experience provides better support for exploratory data analysis, REPL or ripple? We conducted a controlled lab study with 15 data-experienced professionals. Each participant explored four datasets, two in each experience. The REPL sessions left histories with both significantly more data results and significantly more errors than the live sessions. However, both experiences produced comparable numbers of data results that participants self-rated as insightful. Participants largely preferred the live experience for its responsiveness and ability to keep the script content clean, but missed the visible history that a REPL provides.
  • Keywords
    Reliability
  • Publisher
    ieee
  • Conference_Titel
    Visual Languages and Human-Centric Computing (VL/HCC), 2015 IEEE Symposium on
  • Type

    conf

  • DOI
    10.1109/VLHCC.2015.7357205
  • Filename
    7357205