• DocumentCode
    730365
  • Title

    Max-product dynamical systems and applications to audio-visual salient event detection in videos

  • Author

    Maragos, Petros ; Koutras, Petros

  • Author_Institution
    Sch. of ECE, Nat. Tech. Univ. of Athens, Athens, Greece
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    2284
  • Lastpage
    2288
  • Abstract
    This paper introduces a theory for max-product systems by analyzing them as discrete-time nonlinear dynamical systems that obey a superposition of a weighted maximum type and evolve on nonlinear spaces which we call complete weighted lattices. Special cases of such systems have found applications in speech recognition as weighted finite-state transducers and in belief propagation on graphical models. Our theoretical approach establishes their representation in state and input-output spaces using monotone lattice operators, finds analytically their state and output responses using nonlinear convolutions, studies their stability, and provides optimal solutions to solving max-product matrix equations. Further, we apply these systems to extend the Viterbi algorithm in HMMs by adding control inputs and model cognitive processes such as detecting audio and visual salient events in multimodal video streams, which shows good performance as compared to human attention.
  • Keywords
    audio-visual systems; convolution; hidden Markov models; matrix algebra; maximum likelihood estimation; speech recognition; transducers; video signal processing; HMM; VIDEOS; Viterbi algorithm; audio-visual salient event detection; belief propagation; cognitive process; complete weighted lattice; discrete-time nonlinear dynamical system; graphical model; input-output space; max-product dynamical system; max-product matrix equation; monotone lattice operator; nonlinear convolution; speech recognition; state space; weighted finite-state transducer; Hidden Markov models; Integrated circuits; cognitive modeling; event detection; lattices; minimax algebra; multimedia signal processing; nonlinear systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178378
  • Filename
    7178378