DocumentCode :
1994006
Title :
3D Auditory Scene Visualizer with Face Tracking: Design and Implementation for Auditory Awareness Compensation
Author :
Kubota, Yuji ; Shiramatsu, Shun ; Yoshida, Masatoshi ; Komatani, Kazunori ; Ogata, Tetsuya ; Okuno, Hiroshi G.
Author_Institution :
Grad. Sch. of Inf., Kyoto Univ., Kyoto, Japan
fYear :
2008
fDate :
15-16 Dec. 2008
Firstpage :
42
Lastpage :
49
Abstract :
This paper presents the design and implementation of 3D Auditory Scene Visualizer based on the visual information seeking mantra, ``overview first, zoom and filter, then details on demand´´. The machine audition system called HARK captures 3D sounds with a microphone array.The natural language processing called SalienceGraph visualizes topic transition by using discourse salience. The 3D visualizer implemented in Java 3D displays topic transition and each sound stream as a beam originating from the microphones (overview mode), shows temporal snapshots with/without specifying focusing areas (zoom-and-filter mode), and shows detailed information about a particular sound stream (details-on-demand mode). This three-mode visualization will give the user auditory awareness enhanced by HARK and SalienceGraph. In addition, a face-tracking system automatically determines the user´s intention by tracking the user´s face. The resulting system will enable users to manage and browse auditory scene files effectively, so it should acceleration and support the information explosion to compensate the lack of auditory awareness.
Keywords :
Java; auditory displays; data visualisation; face recognition; microphone arrays; natural language processing; user interfaces; 3D auditory scene visualizer; HARK; Java 3D displays; SalienceGraph; auditory awareness compensation; discourse salience; face tracking; machine audition system; microphone array; natural language processing; three- mode visualization; Acceleration; Explosions; Information filtering; Information filters; Java; Layout; Microphone arrays; Natural language processing; Three dimensional displays; Visualization; Auditory scene visualizer; auditory awareness; computational auditory scene analysis; discourse salience.; face tracking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Universal Communication, 2008. ISUC '08. Second International Symposium on
Conference_Location :
Osaka
Print_ISBN :
978-0-7695-3433-6
Type :
conf
DOI :
10.1109/ISUC.2008.59
Filename :
4724440
Link To Document :
بازگشت