• DocumentCode
    729723
  • Title

    Visualizing video sounds with sound word animation

  • Author

    Fangzhou Wang ; Nagano, Hidehisa ; Kashino, Kunio ; Igarashi, Takeo

  • Author_Institution
    Univ. of Tokyo, Tokyo, Japan
  • fYear
    2015
  • fDate
    June 29 2015-July 3 2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Text captions are important means to provide sound information in videos when the sound is not accessible. However, conventional text captions are far less expressive for non-verbal sounds since they are designed to visualize speech sound. To address this problem, we propose a method for automatically transforming non-verbal video sounds to animated sound words, and positioning them near the sound source objects in the video for visualization. This provides natural visual representation of non-verbal sounds with rich information about the sound category and dynamics. We conducted a user study with over 300 participants using an online crowdsourcing service. The results showed that animated sound words could not only effectively and naturally visualize the dynamics of sound while clarify the position of the sound source, but also contribute to making video watching more enjoyable and increasing the visual impact of the video.
  • Keywords
    computer animation; data visualisation; video signal processing; natural visual representation; nonverbal video sounds; sound word animation; video sound visualization; video watching; Algorithm design and analysis; Animation; Attenuation; Engines; Image segmentation; Support vector machines; Visualization; Sound word; entertainment; environmental sound processing; sound visualization; video annotation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2015 IEEE International Conference on
  • Conference_Location
    Turin
  • Type

    conf

  • DOI
    10.1109/ICME.2015.7177422
  • Filename
    7177422