• DocumentCode
    730840
  • Title

    An information-theoretic framework for automated discovery of prosodic cues to conversational structure

  • Author

    Laskowski, K. ; Hjalmarsson, A.

  • Author_Institution
    Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    5376
  • Lastpage
    5380
  • Abstract
    Interaction timing in conversation exhibits myriad variabilities, yet it is patently not random. However, identifying consistencies is a manually labor-intensive effort, and findings have been limited. We propose a conditonal mutual information measure of the influence of prosodic features, which can be computed for any conversation at any instant, with only a speech/non-speech segmentation as its requirement. We evaluate the methodology on two segmental features: energy and speaking rate. Results indicate that energy, the less controversial of the two, is in fact better on average at predicting conversational structure. We also explore the temporal evolution of model “surprise”, which permits identifying instants where each feature´s influence is operative. The method corroborates earlier findings, and appears capable of large-scale data-driven discovery in future research.
  • Keywords
    speech processing; automated discovery; conditonal mutual information measure; conversational structure; information-theoretic framework; interaction timing; prosodic cues; speech segmentation; Entropy; Lead; Manuals; Speech; Yttrium; automated discovery; conditional mutual information; interaction modeling; neural networks; speaking rate;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178998
  • Filename
    7178998