DocumentCode :
2015445
Title :
Auditory augmented reality: Object sonification for the visually impaired
Author :
Ribeiro, Flávio ; Florêncio, Dinei ; Chou, Philip A. ; Zhang, Zhengyou
Author_Institution :
Electron. Syst. Eng. Dept., Univ. de Sao Paulo, Sao Paulo, Brazil
fYear :
2012
fDate :
17-19 Sept. 2012
Firstpage :
319
Lastpage :
324
Abstract :
Augmented reality applications have focused on visually integrating virtual objects into real environments. In this paper, we propose an auditory augmented reality, where we integrate acoustic virtual objects into the real world. We sonify objects that do not intrinsically produce sound, with the purpose of revealing additional information about them. Using spatialized (3D) audio synthesis, acoustic virtual objects are placed at specific real-world coordinates, obviating the need to explicitly tell the user where they are. Thus, by leveraging the innate human capacity for 3D sound source localization and source separation, we create an audio natural user interface. In contrast with previous work, we do not create acoustic scenes by transducing low-level (for instance, pixel-based) visual information. Instead, we use computer vision methods to identify high-level features of interest in an RGB-D stream, which are then sonified as virtual objects at their respective real-world coordinates. Since our visual and auditory senses are inherently spatial, this technique naturally maps between these two modalities, creating intuitive representations. We evaluate this concept with a head-mounted device, featuring modes that sonify flat surfaces, navigable paths and human faces.
Keywords :
acoustic signal processing; audio signal processing; augmented reality; blind source separation; computer vision; handicapped aids; hearing; user interfaces; 3D audio synthesis; 3D sound source localization; 3D sound source separation; RGB-D stream; acoustic virtual objects; audio natural user interface; auditory augmented reality; computer vision methods; flat surfaces; head-mounted device; high-level features of interest; human faces; navigable paths; object sonification; spatialized audio synthesis; virtual objects; visually impaired; Acoustics; Cameras; Encoding; Face recognition; Rendering (computer graphics); Training; Visualization; augmented reality; blind; natural user interface; sonification; spatialization; visually impaired;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing (MMSP), 2012 IEEE 14th International Workshop on
Conference_Location :
Banff, AB
Print_ISBN :
978-1-4673-4570-5
Electronic_ISBN :
978-1-4673-4571-2
Type :
conf
DOI :
10.1109/MMSP.2012.6343462
Filename :
6343462
Link To Document :
بازگشت