مرکز منطقه ای اطلاع رساني علوم و فناوري - Towards a computational model of Acoustic Packaging

DocumentCode :

2693178

Title :

Towards a computational model of Acoustic Packaging

Author :

Schillingmann, Lars ; Wrede, Britta ; Rohlfing, Katharina

Author_Institution :

Appl. Inf. Group, Bielefeld Univ., Bielefeld, Germany

fYear :

2009

fDate :

5-7 June 2009

Firstpage :

Lastpage :

Abstract :

In order to learn and interact with humans, robots need understand actions and make use of language in social interactions. The use of language for the learning of actions has been emphasized by Hirsh-Pasek & Golinkoff introducing the idea of acoustic packaging. Accordingly, it has been suggested that acoustic information, typically in the form of narration, overlaps with action sequences and provides infants with a bottom-up guide to attend to relevant events and to find structure within them. Following the promising results achieved by Brand & Tapscott for infants who packaged sequences together when acoustic narration was provided, in this paper, we make the first approach towards a computational model of the multimodal interplay of action and language in tutoring situations. For our purpose, we understand events as temporal intervals, which have to be segmented in both the visual and the acoustic signal in order to perform acoustic packaging. For the visual modality, we looked at the amount of motion over time via a motion history image based approach. The visual signal is segmented by detecting local minima in the amount of motion. For the acoustic modality, we used a phoneme recognizer, which currently segments the acoustic signal into speech and non-speech intervals. Our acoustic packaging algorithm merges the segments from both modalities based on temporal synchrony. First evaluation results show that acoustic packaging can provide a meaningful segmentation of tutoring behavior.

Keywords :

human-robot interaction; intelligent tutoring systems; speech recognition; acoustic modality; acoustic packaging; motion history image based approach; multimodal interplay; phoneme recognizer; social interaction; temporal synchrony; tutoring situation; visual modality; visual signal; Cognition; Cognitive robotics; Computational modeling; Human robot interaction; Image segmentation; Informatics; Packaging; Pediatrics; Speech; Turning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Development and Learning, 2009. ICDL 2009. IEEE 8th International Conference on

Conference_Location :

Shanghai

Print_ISBN :

978-1-4244-4117-4

Electronic_ISBN :

978-1-4244-4118-1

Type :

conf

DOI :

10.1109/DEVLRN.2009.5175523

Filename :

5175523

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2693178