• DocumentCode
    178635
  • Title

    Online NON-negative Tensor Deconvolution for source detection in 3DTV audio

  • Author

    Mitsufuji, Yuki ; Liuni, M. ; Baker, Anthony ; Roebel, A.

  • Author_Institution
    Sony Corp. Tokyo, Tokyo, Japan
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    3082
  • Lastpage
    3086
  • Abstract
    The following article describes research on source detection in multi channel (3DTV) audio streams. The problem is extremely complex due to the fact that multiple layers can be present in scenes (background music, ambience, commentator). In this work a new algorithm is developed that exploits the information from the different audio channels to detect, and possibly localize and separate independent audio sources. An algorithm based on online Non-negative Tensor Deconvolution is realized, to deal with sound sources with time dependent positions in the channel matrix. The evaluation is made on 3DTV 5.1 film soundtracks and on synthetic mixes of 3DTV 5.1 audio with target sounds from a sound effects database: a significant improvement of the detection performance is shown, compared with other decomposition techniques.
  • Keywords
    audio signal processing; audio streaming; deconvolution; three-dimensional television; 3DTV audio; audio channel detection; multichannel audio stream; online nonnegative tensor deconvolution; sound source; source detection; Conferences; Databases; Dictionaries; Optimized production technology; Source separation; Tensile stress; Training; 3DTV audio; Dictionary training; event detection; nonnegative tensor deconvolution; source separation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854167
  • Filename
    6854167