• DocumentCode
    1994006
  • Title

    3D Auditory Scene Visualizer with Face Tracking: Design and Implementation for Auditory Awareness Compensation

  • Author

    Kubota, Yuji ; Shiramatsu, Shun ; Yoshida, Masatoshi ; Komatani, Kazunori ; Ogata, Tetsuya ; Okuno, Hiroshi G.

  • Author_Institution
    Grad. Sch. of Inf., Kyoto Univ., Kyoto, Japan
  • fYear
    2008
  • fDate
    15-16 Dec. 2008
  • Firstpage
    42
  • Lastpage
    49
  • Abstract
    This paper presents the design and implementation of 3D Auditory Scene Visualizer based on the visual information seeking mantra, ``overview first, zoom and filter, then details on demand´´. The machine audition system called HARK captures 3D sounds with a microphone array.The natural language processing called SalienceGraph visualizes topic transition by using discourse salience. The 3D visualizer implemented in Java 3D displays topic transition and each sound stream as a beam originating from the microphones (overview mode), shows temporal snapshots with/without specifying focusing areas (zoom-and-filter mode), and shows detailed information about a particular sound stream (details-on-demand mode). This three-mode visualization will give the user auditory awareness enhanced by HARK and SalienceGraph. In addition, a face-tracking system automatically determines the user´s intention by tracking the user´s face. The resulting system will enable users to manage and browse auditory scene files effectively, so it should acceleration and support the information explosion to compensate the lack of auditory awareness.
  • Keywords
    Java; auditory displays; data visualisation; face recognition; microphone arrays; natural language processing; user interfaces; 3D auditory scene visualizer; HARK; Java 3D displays; SalienceGraph; auditory awareness compensation; discourse salience; face tracking; machine audition system; microphone array; natural language processing; three- mode visualization; Acceleration; Explosions; Information filtering; Information filters; Java; Layout; Microphone arrays; Natural language processing; Three dimensional displays; Visualization; Auditory scene visualizer; auditory awareness; computational auditory scene analysis; discourse salience.; face tracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Universal Communication, 2008. ISUC '08. Second International Symposium on
  • Conference_Location
    Osaka
  • Print_ISBN
    978-0-7695-3433-6
  • Type

    conf

  • DOI
    10.1109/ISUC.2008.59
  • Filename
    4724440