• DocumentCode
    105383
  • Title

    How Was Your Day? Evaluating a Conversational Companion

  • Author

    Benyon, David ; Gamback, Bjorn ; Hansen, Paul ; Mival, Oli ; Webb, Nick

  • Author_Institution
    Centre for Interaction Design, Edinburgh Napier Univ., Edinburgh, UK
  • Volume
    4
  • Issue
    3
  • fYear
    2013
  • fDate
    July-Sept. 2013
  • Firstpage
    299
  • Lastpage
    311
  • Abstract
    The "How Was Your Day" (HWYD) companion is an embodied conversational agent that can discuss work-related issues, entering free-form dialogues while discussing issues surrounding a typical work day. The open-ended nature of these interactions requires new models of evaluation. Here, we describe a paradigm and methodology for evaluating the main aspects of such functionality in conjunction with overall system behavior, with respect to three parameters: functional ability (i.e., does it do the "rightâ thing conversationally), content (i.e., does it respond appropriately to the semantic context), and emotional behavior (i.e., given the emotional input from the user, does it respond in an emotionally appropriate way). We demonstrate the functionality of our evaluation paradigm as a method for both grading current system performance, and targeting areas for particular performance review. We show correlation between, for example, automatic speech recognition performance and overall system performance (as is expected in systems of this type), but beyond this, we show where individual utterances or responses, indicated as positive or negative, characterize system performance, and demonstrate how our combination evaluation approach highlights issues (both positive and negative) in the companion system\´s interaction behavior.
  • Keywords
    human computer interaction; interactive systems; natural language interfaces; HWYD companion; automatic speech recognition performance; companion system interaction behavior; content behavior; conversational companion evaluation paradigm; embodied conversational agent; emotional behavior; free-form dialogues; functional ability; how was your day companion; overall system performance; system behavior; work-related issues; Context; Educational institutions; Measurement; Prototypes; Speech; Speech recognition; Training; Companions; appropriateness of dialogue; embodied conversational agents; evaluation;
  • fLanguage
    English
  • Journal_Title
    Affective Computing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1949-3045
  • Type

    jour

  • DOI
    10.1109/T-AFFC.2013.15
  • Filename
    6532284